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<g) Use of oxidoreductasea In bleaching and/or detergent compositions and their preparation by microorganisms engineered 
by recombinant DNA technology. 

© The structural genes and their regulatory DNA sequences of 
an alcohol oxidase (MOX) and a dihydroxyacetone synthase 
(DHAS) of Hansenula polymorpha have been isolated and the 
nucleotide sequences determined. The invention relates to the 
use of the MOX gene, as well as the use of the regulatory DNA 
sequences of MOX and /or DAS in combination with the MOX 
gene, optionally after modification thereof, or other oxidase 
genes, or other genes, to produce engineered microorganisms, in 
particular yeasts. 

Said engineered microorganisms can produce oxidases or 
other enzymes in yields that allow Industrial application on a large 
scale. 

Moreover, said engineered microorganisms can produce 
oxidases having improved properties with respect to their appli- 
cation in oxidation reactions and /or in bleaching and detergent 
products. 
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USE OF OXIDOREDUCTASES IN BLEACHING AND/ OR DETERGENT 
COMPOSITIONS AND THEIR PREPARATION BY MICROORGANISMS 
ENGINEERED BY RECOMBINANT DNA TECHNOLOGY 

The present invention relates to a process for micro- 
bio logically preparing oxidoreductases , use of these 
enzymes in bleaching and/ or detergent compositions, as 
well as to microorganisms transformed by DNA sequences 
5 coding for an oxidoreductase and optionally for a di- 
hydroxyacetone synthase-enzyme, and H. polymorpha 
alcohol oxidase and/ or dihydroxyacetone synthase 
regulation sequences, the microorganisms being suitable 
for use in the process. 

10 

Oxidoreductases, especially those Which use oxygen as 
electron acceptor, are enzymes suitable for use in 
bleaching and/or detergent compositions in which they 
can be used for the In situ formation of bleaching 
15 agents, e.g. H 2 0 2 # during the washing or bleaching 
process. See for example 

- GB-PS 1 225 713 {Colgate-Palmolive Company), in which 
the use of a mixture of glucose and glucose oxidase 
and other ingredients in a dry powdered detergent 

20 composition has been described, 

- DE-PA 2 557 623 (Henkel & Cie GmbH) , in which the use 
of a C x to C 3 alkanol and alcohol oxidase, or 
galactose and galactose-oxidase, or uric acid and 
uratoxidase, and other ingredients in a dry detergent 

25 composition having bleaching properties has been des- 

cribed, and 

- GB-PA 2 101 167 (Unilever PLC) in which the use of a 
C x to C 4 alkanol and a to C 4 alkanol 

oxidase in a liquid bleach and/ or detergent com- 
30 position has been described, 

wherein the alkanol and the enzyme are incapable of 
substantial interaction until the composition is 
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diluted with water, and/ or has come into contact with 
sufficient oxygen. 

Up to now natural oxidase- enzymes cannot be produced at 
5 a cost price that allows industrial application on a 
large scale, e.g. detergent products. Moreover, the 
oxidase-enzymes have to act under non-physiological 
conditions when used in detergent and bleaching 
products. Further the natural oxidases that have been 

10 investigated for use in detergent compositions are 
accompanied by the natural catalase-enzyme which 
decomposes almost immediately the peroxide(s) formed, 
so that no effective bleaching is obtained. Thus a need 
exists for oxidase-enzymes that are more suitable for 

15 use under the conditions of manufacture and use of 
detergent and bleaching products. 

For an economically feasible production of these 
oxidases it is further required to reach a yield of 
20 these enzymes in fermentation processes in the order of 
that of alcohol oxidase of H. polymorpha , which is up 
to 20% of the cellular protein (van Dijken et al., 
1976) . 

25 One way of finding new microorganisms producing enzymes 
in higher amounts or finding new oxidase-enzymes having 
improved properties is to check all sorts of micro- 
organisms and try to isolate the relevant oxidases, 
which are then checked for their abilities to generate 

30 peroxides and their stabilities under the conditions of 
manufacture and use of detergent and bleaching pro- 
ducts . One can hope that some day a suitable enzyme 
will be found, but the chance of success is unpredict- 
able and probably very low. 

35 

Another way is to apply another trial and error method 
of crossing the natural microorganisms producing these 
oxidases by classical genetic techniques, in the hope 
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that some day one will find a more productive micro- 
organism or a more suitable enzyme, but again the 
chance of success is rather low, 

5 Clearly, a need exists for a method for preparing 

oxidase-enzyraes in higher yield and/ or without the con- 
comitant formation of catalase and/ or having improved 
properties during storage and/ or use in e.g. bleach 
and/ or detergent compositions. The problem of trial and 

10 error can be overcome by a process for preparing an 
oxidase-enzyme by culturing a microorganism under 
suitable conditions, and preferably concentrating the 
enzyme and collecting the concentrated enzyme in a 
manner known per Be , which process is characterized in 

15 that a microorganism is used that has been obtained by 
recombinant DNA technology and which is capable of pro- 
ducing said oxidase-enzyme. 

The microorganisms suitable for use in a process for 
20 preparing an oxidase-enzyme can be obtained by re- 
combinant DNA technology, whereby a microorganism is 
transformed by a DNA sequence coding for an oxidase- 
enzyme (so-called structural gene) together with one or 
more other DNA sequences which regulate the expression 
25 of the structural gene in a particular microorganism or 
group of organisms, either via introduction of an epi- 
somal vector containing said sequences or via a vector 
containing said sequences which is also equipped with 
DNA sequences capable of being integrated into the 
30 chromosome of the microorganism. 

The determination of a structural gene coding for the 
enzyme alcohol oxidase (EC 1.1.3.13) originating from 
H. polymorpha together with its regulatory 5*- and 3*- 
35 flanking regions will be described as an example of the 
invention without the scope of the invention being 
limited to this example. The spirit of the invention is 
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also applicable to the isolation of DNA sequences of 
other oxidase-enzymes such as glycerol oxidase, glucose 
oxidase, D-amino acid oxidase etc.; the incorporation 
of the DNA sequences or modifications thereof into the 
genome of microorganisms or into episomal vectors used 
for transforming microorganisms and the culturing of 
the transformed microorganisms so obtained as such or 
for producing the desired oxidase-enzymes, as well as 
the use of these enzymes in bleaching compositions 
containing them. 



Although the microorganisms to be used can be bacteria, 
e.g. of the genus Bacillus , as well as moulds, the use 
of yeasts is preferred for technological and economical 
reasons. In particular a mould or yeast can be selected 
from the genera Aspergillus , Candida , Geotrichum , 
Hansenula , Lenzites , Nadsonia , Pichia , Poria , 
Polyporus, Saccharomyces , Sporobolomyces , Torulopsis , 
Trichosporon and Zendera , more particularly from the 
species A. japonicus , A. niger , A. oryzae , C. boidinii , 
H. polymorpha , Pichia pastoris and Kloeckera sp. 2201. 
The latter name is sometimes used instead of c. 
boidinii. ~ 



Many ^-utilizing yeasts have been isolated during 
the last decade, and for Hansenula polymorpha and 
Candida boidinii the methanol metabolism has been 
studied extensively (for a review see Veenhuis et al . , 
1983}. 



The first step in this metabolism is the oxidation of 
methanol to formaldehyde and H 2 0 2 catalysed by 
MOX. Formaldehyde is oxidized further by the action of 
formaldehyde dehydrogenase and formate dehydrogenase. 
H 2 0 2 is split into water and oxygen by catalase. 



Alternatively, methanol is assimilated into cellular 
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material. After its conversion into formaldehyde, this 
product is fixed through the xylulose monophosphate 
pathway into carbohydrates. Dihydroxyacetone synthase 
( DHAS ) plays a crucial role in this assimilation pro- 
5 cess. 

The appearance of MOX, formate dehydrogenase, formal- 
dehyde dehydrogenase, DHAS and catalase is subject to 
glucose repression, e.g. on 0.5% glucose. However, syn- 
10 thesis of MOX is derepressed by growth in low concen- 
trations of glucose (0.1%), contrary to the synthesis 
of DHAS , which is still fully repressed under these 
conditions (Roggenkamp et al . , 1984). 

15 Regulation, i.e. the possibility to switch "on" or 
"off" of the gene for the polypeptide concerned, is 
desirable, because it allows for bioraass production, 
when desired, by selecting a suitable substrate, such 
as, for example melasse, and for production of the 

20 polypeptide concerned, when desired, by using methanol 
or mixtures of methanol and other carbon sources . 
Methanol is a rather cheap substrate, so the poly- 
peptide production may be carried out in a very eco- 
nomical way. 

25 

After derepression of the gene coding for alcohol 
oxidase (MOX) by growth on methanol, large microbodies, 
the peroxisomes are formed. While glucose-grown cells 
contain only a small peroxisome, up to 80% of the in- 

30 ternal volume of the cell is replaced by peroxisomes in 
the derepressed state. The conversion of methanol into 
formaldehyde and &2 0 2 as weil as ^ e degradation of 
H 2°2 ** as been shown to occur in these peroxisomes, 
while further oxidation or assimilation of formaldehyde 

35 most probably occurs in the cytoplasm. This process is 
a perfect example of compartmentalization of toxic pro- 
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ducts, of a strong co-ordinate derepression of several 
cellular processes and of the selective translocation 
of at least two of the enzymes involved in this 
process. 

Most of the enzymes involved in the methanol metabolism 
have been purified and characterized (Sahm, 1977, 
Bystrykh et al, 1981). Especially methanol oxidase (EC 
1.1.3.13) has been studied in detail, it is an octamer 
consisting of identical monomers with an M r value of 
about 74 kd and it contains PAD as a prosthetic group. 
Up to now no cleavable signal sequence for trans- 
location could be detected, as concluded from electro- 
elephoresis studies with ±n vivo and in vitro syn- 
thesized products (Roa and Blobel, 1983) or from in 
vi<tro synthesis in the presence of microsomal membranes 
(Roggenkamp et al., 1984). 

Under derepressed conditions, up to 20% of the cellular 
protein consists of MOX. 

Materials and methods 

a ) Microorganisms and cultivation conditions 

Hansenula polymorpha CBS 4732 was obtained from Dr 
J. P. van Dijken (University of Technology, Delft, 
The Netherlands). Cells were grown at 37°C in 1 
litre Erlenmeyer flasks containing 300 ml minimal 
medium (Veenhuis et al., 1978), supplemented with 
0.5% (v/v) methanol or 0.5% (v/v) ethanol as 
indicated. Phage lambda L47.1 and the P2 lysogenic 
I- coli K12 strain Q 364 were obtained from Dr p. 
van der Elsen (Free University of Amsterdam, The 
Netherlands) and propagated as described (Loenen and 
Brammar, 1980) . 



£• CQl1 K12 strains BHB 2600, BHB 2688 and BHB 2690 
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(Hohn, 1979) were obtained from Dr M. van Montagu 
(University of Gent, Belgium), while E. coli K12 
strain JM 101.7118 and the M13 derivatives M13 mp 8, 
9, 18 and 19 were obtained from Bethesda Research 
5 Laboratories Inc. (Gaithersburg , MD, U.S.A.). 

b) Enzymes 

All enzymes used were obtained from Amersham Inter- 
national PLC, Amersham, U.K., except alpha-helicase 
10 which was obtained from Pharm Industrie, Clichy, 

France. Enzyme incubations were performed according 
to the instructions of the manufacturer. 
ATP : RNA adenyl transferase was purified as described 
by Edens et al. (1982). 



15 



c) Other materials 



[ 35 S3 methionine, [alpha- 35 S] dATP, Calpha- 32 P] 
dNTP's, Lalpha- 32 Pj ATP and Cgamma- 32 P] ATP were 
obtained from Amersham International PLC, Amersham, 
20 U.K. 

Nitrobenzyloxy-methyl (NBM) paper was obtained from 
Schleicher and Schuell, and converted into the diazo 
form (DBM) according to the instructions of the 
25 manufacturer. 

Nitrocellulose filters (type HATF) were obtained 
from Millipore. 

30 RNA isolation, fractionation and analysis 

Hansenula polymorpha cells were grown to mid- 
exponential phase, either in the presence of methanol 
or ethanol. The cells were disrupted by forcing them 
35 repeatedly through a French Press at 16 000 psi, in a 
buffer containing 10 mM Tris-HCl pH 8, 5 mM MgCl 2 , 
1% NaCl, 6% para-aminosalicylic acid, 1% sodium do- 
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decyl sulphate (SDS) and 5% phenol. The purification of 
polyadenylated RNA was subsequently performed, as des- 
cribed previously (Edens et al., 1982). One gram cells 
yielded four mg total RNA and 0.1 mg polyadenylated RNA. 
5 Five microgram samples of total RNA or polyadenylated 
RNA were radioactively labelled at their 3* -ends with 
ATP: RNA adenyl transferase and [alpha- 32 Pj ATP, and 
subsequently separated on a 2.5% polyacrylamide gel 
containing 7 M urea (Edens et al,, 1982). For the 

10 preparative isolation of a specific mRNA fraction, 40 
micrograms polyadenylated RNA was mixed with four 
micrograms of labelled polyadenylated RNA and separated 
on the denaturing polyacrylamide gel. The radioactive 
2.4 kb RNA class was eluted from slices of the gel and 

15 freed from impurities by centrif ugation through a 5-30% 
glycerol gradient in 100 mM NaCl, 10 mM Tris-HCl pH 
7.5, 1 mM EDTA and 0.1% SDS for 15 h at 24 000 
rev./min. in a Bectanann centrifuge using an SW 60 rotor 
at 20 °C. The radioactive fractions were pooled and 

20 precipitated with ethanol. Polyadenylated RNA was 

translated in vitro in a rabbit reticulocyte lysate 
according to Pelham and Jackson (1976), using [ 35 S3 
methionine as a precursor. The translation products 
were immuno-precipitated with MOX antiserum as des- 

25 cribed by Valerio et al. (1983)* 

cDNA synthesis 

One third of the RNA fraction, isolated from the poly- 
30 acryl amide gel, was used to procure a radioactive cDNA 
with reverse transcriptase (Edens et al., 1982), Using 
[alpha- 32 P] dATP and [alpha- 32 P] dCTP of a high 
specific activity (more than 3000 Ci/mM) , 20 000 cpm of 
high molecular weight cDNA was formed during 1 h at 
35 42°C in the presence of human placental ribonuclease 
inhibitor. 
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DNA isolation 

Ten g of Hansenula polymorph a cells were washed with 

1 M sorbitol and resuspended in 100 ml 1.2 M sorbitol, 
5 10 mM EDTA and 100 mM citric acid pH 5.8, to which 100 

microliter beta-mercapto-ethanol was added. Cells were 
spheroplasted by incubation with 500 mg alpha-helicase 
for 1 h at 30 °C. Spheroplasts were collected by centri- 
fugation at 4000 rev./min. in a Sorvall GSA rotor, 

10 resuspended in 40 ml 20 mM Tris-HCl pH 8, 50 mM EDTA 

and lysed by adding 2.5% SDS. Incompletely lysed cells 
were pelleted for 30 min. at 20 000 rev./min. in a 
Sorvall SS34 rotor and DNA was isolated from the 
viscous supernatant by centrif ugation using a CsCl- 

15 ethidium bromide density gradient at 35 000 rev./min. 

for 48 h in a Beckmann centrifuge using a 60 Ti rotor. 

2 mg of DNA was isolated with a mean length of 30 kb. 

Preparation of a clone bank in phage lambda L47.1 

20 

150 microgram Hansenula polymorpha DNA was partially 
digested with Sau 3AI and sedimented through a 10-40% 
sucrose gradient in 1 M NaCl, 20 mM Tris-HCl pH 8 and 5 
mM EDTA for 22 h at 23 000 rev./min. in an SW 25 rotor. 
25 The gradient was fractionated and samples of the 

fractions were separated on a 0.6% agarose gel in TBE 
buffer (89 mM Tris, 89 mM Boric acid, 2.5 mM EDTA). 

Fractions that contained DNA of 5-20 kb were pooled and 
30 the DNA was precipitated with ethanol. Phage lambda 

L.47.1 was grown, and its DNA was isolated as described 
by Ledeboer et al. (1984). The DNA was digested with 
Bam HI and arms were isolated by centrif ugation through 
a potassium acetate gradient as described by Maniatis 
35 et al . (1982). Two microgram phage lambda DNA arms and 
0.5 /ug Sau3AI digested Hansenula polymorpha DNA thus 
obtained were ligated and packaged _in vitro using a 
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protocol from Hohn (1979) ♦ Phages were plated on E • 
coll strain Q 364 to a plaque density of 20,000 pfu per 
14 cm Petri dish* Plaques were blotted onto a nitro- 
cellulose filter (Benton and Davis, 1977) and the blot 
5 was hybridized with the radioactive cDNA probe isolated 
as described above. Hybridization conditions were the 
same as described by Ledeboer et al . (1984) and 
hybridizing plaques were detected by autoradiography. 

10 Isolation and partial amino acid sequence analysis of 
alcohol oxidase (MQX) 

Hansenula polymorpha cells grown on methanol were dis- 
integrated by ultrasonif ication and the cell debris was 

15 removed by centrif ugation. The MOX- containing protein 
fraction was isolated by (NH 4 ) 2 S0 4 precipitation . 
(40-60% saturation) . After dialysis of the precipitate, 
MOX was separated from catalase and other proteins by 
ion- exchange chromatography (DEAE-Sepharose) and gel 

20 filtration (Sephacryl S-400) . Antibodies against MOX 
were raised in rabbits by conventional methods using 
complete and incomplete Freund's adjuvants (Difco Lab, 
Detroit, U.S.A.) • Sequence analysis of alcohol oxidase 
treated with per formic acid was performed on a Beckman 

25 sequenator. Identification of the residues was done 

with HPLC. The amino acid composition was determined on 
a Chromaspek analyser (RanX Hilger, U.K.), using 
standard procedures and staining by ninhydrine. The 
carboxy terminal amino acid was determined as described 

30 by Ambler (1972) . 

Chemical synthesis of deoxyoligonucleotides 

Deoxyoligonucleotides were synthesized on a Biosearch 
35 SAM I gene machine, using the phosphite technique 

(Matteucci and Caruthers, 1981). They were purified on 
16% or 20% polyacrylamide gels in TBE. 
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Hybridization with deoxyoligonucleotide probes 

The deoxyoligonucleotides were radioactively labelled 
with T^-polynucleotide kinase and [gcimma- 32 P3 ATP • 
5 The DNA of the MOX clones obtained was digested with 
different restriction enzymes, separated on 1% agarose 
gel and blotted onto DBM paper* Hybridizations were 
performed as described by Wallace et al . (1981). 

10 DNA sequence analysis 

Prom clone 4 (see Example 1) containing the complete MOX 
gene, several subclones were made in phage M13mp-8, -9 
or M13mp-18, -19 derivatives by standard techniques. 

15 Small subclones (less than 0.5 kb) , cloned in two 

orientations, were sequenced directly from both sides. 
Prom the larger subclones, also cloned in two orien- 
tations, sequence data were obtained by an exonuclease 
Bal31 digestion strategy (see Fig. 1). For each of both 

20 cloned orientations the RF Ml 3 DNA is digested with a 
restriction enzyme that preferably cleaves only in the 
middle of the insert. Subsequently, both orientations of 
the clones were cut at this unique site, and digested 
with exonuclease Bal 31 at different time intervals. 

25 Incubation times and conditions were chosen such that 
about 100-150 nucleotides were eliminated during each 
time interval. Each fraction was digested subsequently 
with the restriction enzyme, recognizing the restriction 
site situated near the position at which the sequence 

30 reaction is primed in the M13 derivatives. Ends were made 
blunt end by incubation with T 4 -polymerase and all 
dNTP's, and the whole mix was ligated under diluted 
conditions, thereby favouring the formation of internal 
RF molecules. The whole ligation mix was used to trans- 

35 form to E. coli strain JM 101-7118. From each time 

interval several plaques were picked up and sequenced 
using recently described modifications of the Sanger 
sequencing protocol (Biggin et al., 1983). 
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The isolation of auxotrophic mutants 

LEU-1 (CBS N° 7171) is an auxotrophic derivative of 
M* polymorpha strain NCYC 495 lacking A-isopropylmalate 
5 dehydrogenase activity. The isolation of this mutant 
has been described by Gleeson et al . (1984). 

LR9 (CBS N" 7172) is an auxotrophic derivative of H. 
polymorpha ATCC 34438, lacking orotidine S'-decarbox- 
10 ylase activity. 

For the isolation, all procedures were carried out at 
30 °C instead of 37 °C, which is the optimal temperature 
for growth of this yeast. Yeast cells were mutagenized 

15 with 3% ethylmethanesulphonate for 2 hr (Fink, 1970). The 
reaction was stopped with 6% sodium thiosulphate (final 
concentration) and the solution was incubated for another 
10 rain. Mutagen! zed cells were then washed once with 
H 2 0 and incubated for 2 days on YEPD or YNB supple- 

20 mented with uracil for segregation and enrichment of 

uracil-auxotrophs followed by a 15 hr cultivation on MM 
without nitrogen source. Finally a nystatin enrichment 
was employed for 12 hr on MM with a concentration of 10 
^ug antibiotic per ml. The treated cells were plated 

25 on YNB plates containing 200 ^ug uracil per ml and 
0.8 mg 5-f luoroorotic acid (Boeke et al., 1984). 
Usually 10 6 cells were plated on a single plate. 
Resistant colonies were picked after 3 days of incu- 
bation, replica plated twice on YNB plates to establish 

30 the auxotrophy. From the auxotrophic mutants ura~ 

cells were isolated. Alternatively, 1.5 x 10 6 yeast 
cells were incubated in one ml of YNB liquid medium 
supplemented with 200 ^ug of uracil and 0.8 mg of 5- 
f luoroorotic acid. After incubation of 2 days, the 

35 treated cells were plated on YNB containing uracil, 

replica-plated twice on YNB and analysed as described 
above. 
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Such resistant mutants have been shown to be uracil 
auxotrophs affected at the URA3 or the URA5 locus in S. 
cerevisiae (F. Lacroute, personal communication) . Of 
about 600 resistant colonies of H. polymorpha tested, 
5 5 2 exhibited a uracil phenotype. Since URA3 and URA5 
mutations in S. cerevisiae lack orotidine 5*- 
decarboxylase and orotidine 5' -phosphate pyrophosphor- 
ylase, respectively (Jones and Fink, 1982), the ob- 
tained uracil auxotrophs of H. polymorpha were tested 

10 for both enzymatic activities (Lieberman et al . , 

1955) . Mutants affected in either of the two enzymes 
were found (Table !)• They have been designated odcl 
and oppl mutants, respectively. The odcl mutants ex- 
hibit adequate low reversion frequencies (Table II) and 

15 thus are suitable for transformation purposes by com- 
pi eraent at ion . 

Isolation of autonomous replication sequences (HARS) 
from H . polymorpha 

20 

Chromosomal DNA from H. polymorpha was partially di- 
gested either with Sai l or Bam HI and ligated into the 
single Sai l and Bam HI site of the integrative plasmid 
YIp5, respectively. The ligation mixture was used to 
25 transform E. coli 490 to ampicillin resistance. YIp5 is 
an integrative plasmid containing the URA3 gene as a 
selective marker (Stinchcomb et al . , 1980). 

The plasmid pool of H. polymorpha Sai l clones was used 
30 to transform H. polymorpha mutant LR9. A total of 27 
transformants was obtained being also positive in the 
/3-lactamase assay. From all of them, plasmids could be 
recovered after transformation of E. coli 490 with 
yeast minilysates. Restriction analysis of the plasmids 
35 revealed that most of the inserts show the same 

pattern. The two different plasmids, pHARSl and pHARS2, 
containing inserts of 0.4 and 1.6 kb respectively, were 
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used for further studies (Fig. 2). Both plasmids 
transform H. polymorpha mutant LR9 with a frequency of 
about 500-1*500 transf ormants per /ug of DNA using 
the transformation procedure of intact cells treated 
with polyethyleneglycol . Southern analysis of the H. 
polymorpha transformants after retransforraation with 
pHARSl and pHARS2 recovered from E . coli plasmid pre- 
parations shows the expected plasmid bands and thus 
excludes integration of the URA3 gene as a cause of the 
uracil protrophy. Therefore, we conclude that the HARS 
sequences like ARS1 ( Stinchcomb et al., 1982) allow 
autonomous replication in H. polymorpha . Neither HARS1 
nor HARS 2 enabled autonomous replication in S. cere - 
visiae. HARS1 was sequenced completely as shown in Fig. 
3. 

Estimation of plasmid copy number In H, polymorpha 
trans f ormants 

The copy number of plasmids conferring autonomous rep- 
lication in H . polymorpha either by ARS sequences or by 
HARS sequences was estimated by Southern blot analysis 
(Fig. 4). For comparison, plasmid YRP17 in S. cere - 
visiae (Fig. 4, lanes 6, 7), which has a copy number 
of 5-10 per cell (Struhl et al., 19 79) and the high 
copy number plasmid pRB58 in S. cerevisiae (Fig. 4, 
lanes 4, 5) with about 30-50 copies per cell were 
used. YRP17 is a URA3- containing yeast plasmid, 
bearing an ARS sequence (Stinchcomb et al. # 1982), 
while pRB58 is a 2 yum derivative containing the URA3 
gene (Carlson and Botstein, 1982). A Kluyveromyces 
lactis trans forraant carrying 2 integrated copies of pBR 
pBR322 was used as a control (Fig. 4, lanes 2, 3). The 
intensity of staining in the autoradiogram reveals 
that the plasmid YRP17 in H ♦ polymorpha has practically 
the same copy number as in S. cerevisiae , whereas plas- 
mids pHARS-1 and pHARS-2 show a copy number which is in 
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the range of about 30-40 copies per cell like pBR58 in 
S. cerevisiae . This proves once more the autonomously 
replicating character of the HARS sequence. 



5 Transformation procedures 

Several protocols were used. 

a) H • polymorpha strain LEU-1 was transformed using a 

10 procedure adapted from Beggs (1978). The strain was 

grown at 37 "C with vigorous aeration in 500 ml 
YEPD liquid medium up to an OD^qq of 0*5. The 
cells were harvested, washed with 20 ml distilled 
water and resuspended in 20 ml 1.2 M sorbitol, 25 mM 

15 EDTA pH 8.0, 150 mM DTT and incubated at room tem- 

perature for 15 minutes. Cells were collected by 
centrif ugation and taken up in 20 ml 1.2 M sorbitol, 
0.01 M EDTA, 0.1 M sodium citrate pH 5.8 and 2% v/v 
beta-glucuronidase solution (Sigma 1500000 units/ml) 

20 and incubated at 37 °C for 105 minutes. After 1 hr, 

the final concentration of beta-glucuronidase was 
brought to 4% v/v. For transformation, 3 ml aliquots 
of the protoplasts were added to 7 ml of ice cold 
1.2 M sorbitol, 10 mM Tris-HCl pH 7. Protoplasts 

25 were harvested by centrif ugation at 2000 rpm for 5 

minutes and washed three times in ice cold sorbitol 
buffer. Washed cells were resuspended in 0.2 ml 1.2 
M sorbitol, 10 mM CaCl 2 , 10 mM Tris-HCl pH 7 on 
ice. 2 ^ug of YEP13 DNA - an autonomous repli- 

30 eating S. cerevisiae plasmid consisting of the LEU2 

gene of S. cerevisiae and the 2 micron-ori (Broach 
et al., 1979) - were added to 100 ml of cells and 
incubated at room temperature. 0.5 ml of a solution 
of 20% PEG 4000 in 10 mM CaCl 2 , 10 mM Tris-HCl pH 

35 7.5 was added and the whole mixture was incubated 

for 2 minutes at room temperature. Cells were col- 
lected by brief (5 sec.) centrif ugation in an MSG 
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microfuge set at high speed and resuapended in 0.1 
ml YEPD 1.2 M sorbitol pH 7.0, and incubated for 15 
minutes at room temperature. The cells were plated 
directly by surface spreading on plates containing 
2% Difco agar, 2% glucose, 0.67% Difco yeast 
nitrogen base and 20 mg/1 of each of L-adenine 
Hemisulphate, methionine, uracil, histidine, 
tryptophan, lysine and 1.2 M sorbitol. Leu + 
traneformants appear after 5 days incubation at 37 °C 
with a frequency of 50 colonies/yug DNA, while no 
traneformants appear if no DNA is added. 

Alternatively, H. polymorpha LEU-1 was transformed 
with YEP13, using a procedure adapted from Das et 
al. (1984). Exponentially growing cells were grown 
up to an OD 600 of 0.4, washed in TE buffer (50 mM 
Tris-HCl pH 8.0, 1 mM EDTA) and resuspended in 20 ml 
TE buffer. 0.5 ml cells were incubated with 0.5 ml 
0.2 M Li CI for 1 hr at 30 °C. To 100 ml of these 
cells 4 /ug YEP13 in 20 ml TE buffer was added and 
the sample was incubated for a further 30 minutes at 
30 # C. An equal volume of 70% v/v PEG 4000 was added 
and the mixture was incubated for 1 hr at 30 o C, fol- 
lowed by 5 min. at 42 W C. After addition of 1 ml 
H 2 0, cells were collected by a brief centri- 
fugation as described under a) , washed twice with 
H 2 0 and resuspended in 0.1 ml YEPD 1.2 M sorbitol 
and incubated for 15 minutes at room temperature. 
Cells were plated as described. Leu + transformants 
appear with a frequency of 30/^ug DNA. 

The H. polymorpha URA mutant LR9 was transformed 
with YRP17, a plasmid containing the URA3 gene of S. 
cerevisiae as a selective marker and an autonomously 
replicating sequence ( ARS } for S. cerevisiae 
(Stinchomb et al, 1982). Using the protoplast method 
described by Beggs (1978), 2-5 transformants/ ^ug 
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DNA were obtained. This number was enlarged, using 
the IiiS0 4 method of Ito et al . (1983), up to 15-20 
transformants per ^ug of DNA, However, the best 
procedure was the procedure described by Klebe et 
5 al. (1983) , using intact cells treated with PEG 

4000. Up to 300 transformants were obtained per 
/ug DNA. The LiSO^ procedure, as well as the 
Klebe procedure, was performed at 37 °C. 

10 Transformation of H. polymorpha based on autonomous 

replication of the vector was indicated by two charac- 
teristics: (1) the instability of the uracil" 1 " pheno- 
type. After growth of transformants on YEPD for ten 
generations, more than 99% had lost the ability to grow 

15 on selective medium (Table II). (2) Autonomous repli- 
cation was further ascertained by transforming E. coli 
cells with yeast minilysates and retrans formation of H. 
polymorpha . Subsequent Southern analysis showed the 
presence of the expected plasmid. 

20 

*?• polymorpha LR9 could not be transformed with pRB58, 
or with pHH85, constructed by insertion of the whole 2 
micron circle DNA (Hollenberg, 1982) into the Pst I site 
of the ampicillin gene of plasmid YIPS. YIPS , con- 

25 taining the DNA sequence of HARS1 or HARS2, was trans- 
ferred to H. polymorpha LR9 using the Klebe protocol 
with a frequency of 500-1500 transformants per ^ug of 
DNA. Thus, transformation frequency is 2-5 times higher 
than described above, using the heterologous ARS 1 in 

30 YRP17 of S. cerevisiae . Similarly, the stability of the 
HARS plasmid in transformants is slightly higher than 
the ARS 1 plasmid (Table II). 

Transformation of H. polymorpha by integration of the 
35 URA3 gene from S. cerevisiae 

The URA3 gene of S. cerevisiae shows no homology to the 
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ODC gene in H. polymorpha , as revealed by Southern 
hybridisation of nick- translated YIp5 plasmid DNA to 
chromosomal DNA of H. polymorpha . Therefore, low- 
frequency integration of the URA3 gene at random sites 
5 of the H. polymorpha genome had to be anticipated. 
Transformation of mutant LR9 with the integrative 
vector Ylp5 resulted in 30-40 colonies per ^ug of DNA 
on YNB plates using the polyethyleneglycol method, 
whereas no transf ormants were obtained in the control 
10 experiment using YIp5 for transformation of S. cere- 
visiae mutant YNN27 . Analysis of 38 transf ormants re- 
vealed 4 stable integrants after growth on non- 
selective medium. The integration event was further 
demonstrated by Southern analysis (Pig. 5). 

15 

A second procedure for generating integration of the 
URA3 gene into chromosomal DNA of H. polymorpha was 
performed by enrichment of stable Ura + transf ormants 
from transf ormants carrying plasmid pHARSl. Trans- 

20 formants were grown in liquid YEPD up to a density of 
10 9 cells per ml. An aliquot containing 5 x 10 6 
cells was used to inoculate 100 ml of fresh medium and 
was grown up to a cell density of 10 9 per ml. The 
procedure was repeated until about 100 generations had 

25 been reached. Since the reversion rate -of mutant LR9 is 
2 x 10~ 9 and the frequency of plasmid loss per 10 
generations is 97% in pHARSl transf ormants , the pre- 
dominant part of the Ura + cells after 100 generations 
should be integrants. The Ura + colonies tested were 

30 all shown to maintain a stable Ura + phenotype indi- 
cating an integration of the URA3 gene. This was 
further verified by Southern blot analysis. In 
addition, these data indicate that the integration 
frequency is 5 x 10~ 6 . 
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Example 1 

CLONING OP THE GENE FOR ALCOHOL OXIDASE (MOX) PROM 
HANSENULA POLYMORPHA 

5 



Characterization of polyadenylated RNA 

Total RNA and polyadenylated RNA, isolated from cells 

10 grown on methanol, were labelled at their 3* -termini with 
ATP: RNA adenyl transferase, and separated on a de- 
naturing polyacrylamide gel (Fig. 6). Apart from the rRNA 
bands, two classes of RNA appear in the poly-adenylated 
RNA lane, respectively 1 Kb and 2.3 kb in length. Since 

15 these RNA classes are not found in polyadenylated RNA of 
ethanol-grown cells (result not shown), they obviously 
are transcripts of genes derepressed by growth on 
methanol. The 2.3 kb class can code for a protein of 
700 to 800 amino acids, depending on the length of the 

20 non-translated sequences. Likewise, the 1 kb class 
codes for a protein of 250-300 amino acids. Enzymes 
that are derepressed by growth on methanol and are 700 
to 800 amino acids long, most likely are MOX (Kato et 
al., 1976; Roa and Blobel, 1983) and DHAS (Bystrykh et 

25 al., 1981). Derepressed enzymes in the 250 to 300 amino 
acid range are probably formaldehyde and formate de- 
hydrogenase (Schiltte et al., 1976). The polyadenylated 
RNA was characterized further by in vitro translation 
in a reticulocyte cell free translation system. Two 

30 microliters of the polyadenylated RNA directed protein 
mixture were separated directly on a 10% SDS poly- 
acrylamide gel, while the remaining 18 microliters were 
subjected to immuno-precipitation with antiserum 
against MOX (Fig. 7). Six strong bands dominate in the 

35 total protein mixture, having molecular weights of 
respectively 78kd, 74kd, 58kd, 42kd, 39kd and 36kd. 
Essentially the same molecular weights were found by 



20 



0173378 



Roa and Blobel (1983) in a total cell extract from 
methanol-grown H . polymorpha cells. 

The 74kd protein can tentatively be assigned to the 
5 monomer of MOX, the 58kd protein to the monomer of 

catalase and the 39kd and 36kd proteins to the monomers 
of formaldehyde dehydrogenase and formate dehydrogen- 
ase, respectively. The 78kd polypeptide possibly is 
DHAS, while the 42kd polypeptide remains unidentified. 
10 After immuno-precipitation, both high molecular weight 
proteins react with the MOX antiserum. 

Cloning of the gene for MOX 

15 Although the 2.3 kb mRNA class induced by growth on 

methanol obviously codes for at least 2 polypeptides, 
it seemed a good candidate for screening a Hansenula 
polymorpha clone bank by hybridization. The 5-20 kb 
fraction of partially Sau 3AI digested H. polymorpha DNA 

20 was cloned in phage lambda L47.1. 

Per microgram insert DNA, 300 000 plaques were obtained 
while the background was less than 1:1000. Two Benton 
Davis blots, containing about 20 000 plaques each, were 

25 hybridized with 15 000 cpm of the mRNA-derived cDNA 
probe. After 3 weeks of autoradiography about 40-50 
hybridizing plaques could be detected. All plaques were 
picked up and five were purified further by plating at 
lower density and by a second hybridization with the 

30 cDNA probe. Prom four, single hybridizing plaques (1, 
3, 4, 5) DNA was isolated* The insert length varied 
from 8 to 13 kb. 

Hybridization selection using organic- synthetic DNA 
35 probes 

The sequence of 30 amino acids at the amino terminus of 
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purified MOX was determined (Pig. 8). 

Using the most abundant codon use for the yeast S. 
cerevisiae , a sequence of 14 bases could be derived 
from part of this protein sequence, with only one am- 
biguity. Both probes, indicated in Fig. 4, were syn- 
thesised. In both probes an Eco RI site is present. DBM 
blots were made from the DNA of the MOX clones digested 
with the restriction enzymes BamHI, Eco Rl/ Hin dlll , 
Hin dlll/Sall and Pst l/ Sal l and separated on 1.5% 
agarose gels. After hybridization of the blot with a 
mixture of both radioactively labelled probes, the 
clones 1, 4 and 5 hybridize, while clone 3 does not, as 
shown for the Hindlll/Sall blot in Pig. 9. However, the 
probes did not hybridize with the EcoRl/ Hin dlll digested 
DNA of these clones (result not shown). Since an EcoRI 
site is present in the probes, the hybridizing DNA in 
the clones probably is cut by this enzyme too. 
Consequently the hybridization overlap has become too 
small to allow the formation of stable hybrids. 

Restriction map and sequence analysis 

By comparing restriction enzyme digests and by cross- 
hybridization experiments it was concluded that clones 
1, 4 and 5 covered identical stretches of DNA. 

In order to definitely establish the nature of this 
stretch of cloned DNA the insert of clone 4 was ana- 
lyzed in detail. Hybridization with the amino terminal 
probe showed that the complete MOX gene (ca. 2 kb) was 
present, including 2 kb sequences upstream and 3.5 kb 
downstream (Fig. 10). 

DNA sequence analysis of the smallest EcoRI fragment 
revealed the nucleotide sequence corresponding to the 
amino terminus of MOX as was determined by amino acid 
sequence analysis. 
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For sequence analysis, several fragments were subcloned 
in M13mp8/M13mp9 or M13mpl8/M13mpl9 respectively in two 
orientations, as indicated in Fig. 10. Clones that were 
smaller than 0.5 kb were sequenced directly from both 
sides. The larger clones were cut at the unique re- 
striction sites situated in the middle of the cloned 
fragment, to allow generation oi -exonuc lease Bal 31 
digested subclones as described* in materials and 
methods. Using specific oligonucleotide primers, 
sequences around the restriction suites used for sub- 
cloning and sequences that did not allow an unequivocal 
sequence determinatioh were sequenced once more, using 
the 5.5 kb Baro HI/ Sac I subclone that covers the whole 
sequence. The complete nucleotide sequence is given in 
Fig. 11A and 11B. 

The sequence contains an open reading frame of 2046 
nucleotides that can code x for a protein of 664 amino 
acids. The last codon of the open reading frame codes 
for Phe, which is in agreement with the carboxy ter- 
minus of purified MOX. The amino acid composition 
derived from the DNA sequence encoding this protein, 
and the amino acid composition of purified MOX are 
yit-tually identical (Table III). The only important 
differences involve the serine and threonine residues, 
^ which are notoriously difficult to determine. 

i v . ■ 
The* calculated molecular weight of the protein is 74 050 
Daltoh, which agrees well with the molecular weight of 
74 kd of MOX, as determined on polyacrylamide/SDS gels. 

Codon usage 

In Table IV the codon usage for MOX is given. A bias 
towards the use of a selective number of codons is 
evident. 
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Example 2 

CONSTRUCTION OF A PLASMID, pUR 3105 , BY WHICH THE GENE 
CODING FOR NEOMYCIN PHOSPHOTRANSFERASE , THAT CONFERS 
RESISTANCE AGAINST THE ANTIBIOTIC G 418, IS INTEGRATED 
INTO THE CHROMOSOMAL MOX GENE UNDER REGIE OF THE MOX 
REGULON • 



10 H. polymorph* cells, transformed with either the plas- 
mids YEP 13, YRP 17, pHARS 1 or pHARS 2, were unstable 
and lost their leu + or ura + phenotype already after 
10 generations upon growth under non- selective con- 
ditions. In order to obtain stable transf ormants and to 

15 test the MOX promoter, a plasmid pUR 3105 is construc- 
ted in which the neomycin phosphotransferase gene 
(NEO R ) is brought under direct control of the MOX 
regulon. The construction is made in such a way that 
the first ATG of the NEO R gene is coupled to 1.5 kb 

20 of the MOX regulon. The cloning of such a large regu- 
lon fragment is necessary as shorter fragments, that 
do not contain the -1000 region of the regulon, were 
less efficient. 

25 The NEO R gene was isolated as a 1.1 kb Xma lll- Sal l 

fragment from the transposon Tn5, situated from 35 bp 
downstream of the first ATG up to 240 bp downstream of 
the TGA translational stop codon. To avoid a complex 
ligation mixture, first pUR 3101 is constructed (Fig. 

30 12A), which is a fusion of the far upstream Sall-Xmalll 
(position -1510 to position -1128) fragment of the MOX 
regulon, and the NEO R gene, subcloned on M13mp9. 
Another plasmid is constructed, pUR 3102, in which the 
1.5 kb Sal l- HgiA I fragment of the MOX gene, that covers 

35 nearly the whole MOX regulon, is ligated to a MOX- 

NE0 R adapter (Fig. 12B) sequence and cloned in M13-mp9. 
The 1.2 kb Xma lll fragment of this plasmid is cloned in- 
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to the Xma lll site of pUR 3101, resulting in pUR 3103/ 
which is the exact fusion of the MOX regulon and the 
NEO R gene (Fig. 12C) . The orientation is checked by 
cleavage with HgiA I and Sai l , From the larabda-MOX-4 
5 clone, a Sal l- Sac I fragment is subcloned that reaches 
from the Sai l site, still in the structural MOX gene 
(position 894) , up to the Sac I site, far downstream of 
the structural MOX gene (position 3259) {see Fig. 10). 
This M13mpl9 subclone is called pUR 3104. The plasmid 
10 pUR 3105 is obtained by the direct ligation of the 2.7 
kb Sai l fragment from pUR 3103 into the Sai l site of 
pUR 3104. The orientation is tested by cleavage with 
Smal and SacI. 



15 After cleavage of this plasmid with Hindlll and Sac I 
and the transformation of this cleaved plasmid to H. 
polyinorpha , G 418- resistant colonies are found that 
do not lose their resistance upon growth under non- 
selective conditions for a large number of generations. 
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Example 3 

THE CONSTRUCTION OF pUR 3004, BY WHICH THE GENE CODING 
FOR D-AMINO ACID OXIDASE IS TRANSFERRED TO THE CHROMO- 
5 SOME OF H. POLYMORPHA UNDER REGIE OF THE MOX-REGULON 



D-amino acid oxidase (AAO) is an example of an oxido- 
reductase for the production of which the methylo- 

10 trophic H. polymorpha is extremely suited . It might be 
expected that the enzyme, being an oxidase liXe MOX, is 
translocated to the peroxisomes of the yeast that are 
induced during growth on methanol or a mixture of 
methanol and a fermentable sugar as carbon source and 

15 D-amino acids as the sole nitrogen source. Under these 
conditions the cell will be protected from the H 2 0 2 
produced. Alternatively, AAO can be produced without 
the production of H 2 02# when it is placed under 
regie of the MOX- or DAS-regulon. The AAO production 

20 will be induced by the presence of methanol in the 
medium. 

The amino acid sequence of the AAO enzyme has been pub- 
lished (Ronchi et al., 1981) and the complete gene is 

25 synthesised, using the phosphite technique (Matteuci 
and Caruthers, 1981). The gene is constructed in such 
a way that the optimal codon use for H. polymorpha , as 
derived from the sequence of the MOX gene, is used. 
Moreover, several unique restriction sites are intro- 

30 duced without changing the amino acid sequence, to 
facilitate subcloning during the synthesis. The DNA 
sequence is shown in Fig. 13. The gene is synthesised 
in oligonucleotides of about 5 0 nucleotides in length. 
Oligonucleotides are purified on 16% polyacrylamide 

35 gels. The oligonucleotides that form a subclone are 

added together in ligase buffer (Maniatis et al . , 1982) 
and heated to 7 0° C in a waterbath. The waterbath is 
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slowly cooled to 16 °C and T 4 -ligase is added. After 
two "hours of ligation, the DNA is separated on a 1.5% 
agarose gel and the fragment, having the expected 
length, is isolated from the gel. It is subcloned in 
an M13mpl8 vector cleaved at the respective restriction 
sites situated at the end of the fragment. The gene is 
subcloned in this way in 4 subclones, respectively 
Sal l- Hin dlll (position 39-346), Hin dlll- Xma l (position 
346-589), Xmal-Kpnl (position 589-721) and KpnI-Sall 
(position 721-1044). The Sall-Hindlll and Hindlll-Xmal 
subclones and the Xma l- Kpn l and Kpn -I- Sal l subclones 
are ligated together as two Sal l- Xma l subclones in 
Sal l- Xma l cleaved M13mpl8. These two subclones are 
ligated into a Sai l cleaved M13mp8, resulting in pUR 
3001 (Figs 13, 14A) . The whole sequence is confirmed 
by the determination of the nucleotide sequence using 
the modified Sanger dideoxy sequencing technique 
(Biggin et al., 1983). 

The construction of the integrative plasmid, containing 
the AAO gene is shown in Pig. 14A,B. The nearly com- 
plete AAO gene is placed upstream of the MOX termin- 
ation region, by insertion of the AAO gene-containing 
Sail fragment of pUR 3001, in the unique Sai l site of 
pUR 3104 (see also Fig. 14A) , resulting in pUR 3002. 
The orientation is checked by cleavage with Hin dlll. 
The MOX promoter region is isolated as a 1.4 kb Sall- 
HgiA I fragment from pUR 3102 (Fig. 14A) . This fragment 
is subsequently placed upstream of the AAO gene in pUR 
3002, by ligation to partially Sal I- digested pUR 3002 
in the presence of the HgiA I- Sal l MOX-AAO adapter, 
shown in Fig. 14A. The orientation of the resulting 
plasmid pUR 3003 is checked again by cleavage with 
Hin dlll. This plasmid is integrated into the MOX gene 
after cleavage with SacI and transformation to H. poly - 
morpha cells. Transformants are selected by their 
ability to grow on D-amino acids as nitrogen source in 
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the presence of methanol as inducer. 

As the selection of cells containing the AAO gene is 
not simple, another selection marker is introduced. To 
5 this end, the S. cerevisiae LEU2 gene is integrated 
in between the structural AAO gene and the MOX 
terminater. For this construction, the plasmid pURS 
528-03 is used. This plasmid is derived from pURY 
528-03 described in European patent application 

10 0096910. The construction is shown in Fig. 14C. The 
deleted carboxy terminal LEU2 gene sequence of pURY 
528-0 3 was replaced by the complete carboxy terminal 
LEU2 gene sequence from pYeleu 10 (RatzXin and Carbon, 
1977) and the E . coli lac-lac regulon was eliminated. 

15 Subsequently the Hpa l- Sal l fragment of pURS 528-03 
containing the LEU 2 gene, is blunt end inserted in 
the Sai l site of pUR 3003, situated in between the AAO 
structural gene and the MOX terminater. The orientation 
of the resulting plasmid pUR 3004 can be checXed by 

20 cleavage with Sai l and Sac I . pUR 3004 integrates in the 
chromosomal MOX gene of H. polymorpha after trans- 
formation of the Sacl-cleaved plasmid to a H. poly - 
morpha leu"" mutant. Selected leu + transf ormants are 
integrated in the chromosomal MOX gene, together with 

25 the AAO gene. 
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Example 4 

THE CONSTRUCTION OF pUR 3204, pUR 3205, pUR 3210 ana 
PUR 3211, BY WHICH THE SMALL PEPTIDE HORMONE, THE HUMAN 
GROWTH RELEASING FACTOR, IS EXPRESSED UNDER REGIE OF 
THE MOX-REGULON, EITHER BY INTEGRATION INTO THE CHROMO- 
SOMAL MOX GENE (pUR 3203, pUR 3204), OR BY INTEGRATION 
INTO A HARS1 -CONTAINING PLASMID (pUR 3205) OR BY FUSION 
TO THE MOX STRUCTURAL GENE (pUR 3209, pUR 3210 and pUR 
3211) . 



Human growth hormone releasing factor (HGRF) is a 
small, 44 amino acids long, peptide, that activates the 
secretion of human growth hormone from the pituitary 
glands. HGRF can be used in the diagnosis and treatment 
of pituitary dwarfism in man. Since HGRF has been shown 
to induce growth hormone stimulation in numerous 
species, HGRF might be used in the vetinary field too, 
by stimulating growth of animals and increase of milk' 
production ( Coude et al. , 1984). it is difficult to ob- 
tain HGRF from human sources, but it could very well be 
produced by biotechnological processes, once the gene 
has been cloned and transferred to an appropriate host 
organism. Also, as a general example of the production 
of a peptide hormone by H. polymorpha , the gene for 
HGRF is synthesised in the optimal codon use of H. 
polymorpha and brought to expression in several ways. 

For the construction of pUR 3204 and pUR 3205, the gene 
fragment that codes for the carboxy terminal part of 
the protein is synthesised in DNA oligomers of about 50 
nucleotides in length and subcloned as a Hindlli-Sali 
fragment in Hindlll-Sall cleaved M13mpl8, resulting~Ln 
PUR 3201 (Figs 15, 16A) . This Hindi I I- Sal I fragment is 
subsequently inserted upstream of the MOX terminater in 
Hindlll-Sall cleaved pUR 3104 (Fig. 16A) , resulting in 
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PUR 3202. The MOX promoter is inserted in front of the 
HGRF gene, by insertion of the Sall-HgiAI MOX-promoter 
fragment from pUR 3102 (Fig. i 6A ) in Hindlll cleaved 
PUR 3202, using a HgiAI-Hindlll adapter between the 
MOX-promoter and the HGRF gene (Figs 15, 16A) . The ori- 
entation of the resulting plasmid pUR 3203 is checked 
by cleavage with Sail and HgjLAI . pUR 3203 integrates 
into the chromosomal MOX gene of H. polymorpha after 
transformation of the SacI cleaved plasmid. Trans- 
formants are selected on immunological activity. pU R 
3203 is cleaved with Sail, to insert the Sall-Hpal 
fragment of pURS 528-03 (Fig. 16B) that contains the 
LEU2 gene. The orientation of this gene in P UR 3204 is 
checked by cleavage with Hindlll and EcoRI. pU R 3204 
integrates into the chromosomal MOX gene of H. poly- 
morpha after transformation of the SacI cleaved plasmid 
(Fig. 16B) to a leu" H. polymorpha mutant. Selection 
on on leu + transf ormants . A plasmid, called pUR 3205, 
that replicates autonomously in H . polymorpha and con- 
tains the HGRF gene, is obtained by insertion of the 
EcoRI, partially Hindlll cleaved 4 kb long fragment of 
PUR 3203, containing the HGRF gene inserted in between 
the MOX-promoter and terminates into partially Hindin- 
EcoRX cleaved pHARSl (Figs 2, 16C) . The construction of 
PUR 3205 is checked by cleavage with Hindlll. 

The production of small peptides as HGRF by micro- 
organisms is often unstable as a result of enzymic 
degradation ( Itakura et al., 1977). Fusion to a protein 
like MOX, and subsequent transport to the peroxisomes, 
could prevent degradation. Therefore, we decided to in- 
sert the HGRF gene into the unique Kpn l site at 
position 1775 (amino acid 591, Figs 10, 11) of the MOX 
structural gene. The HGRF gene is synthesised again in 
DNA oligomers of 50 nucleotides in length, but now as 
two Kpnl-Hindlll subclones that are cloned as a com- 
plete HGRF structural gene in M13mpl9, cleaved with 
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Kpnl (plasmid pUR 3206, Figs 17, 16D) . Moreover, the 
ATG triplet coding for the internal methionine of HGRP 
at position 27 ( Coude et al., 1984) (position 82 of the 
DNA sequence) is converted into a TGT triplet coding 
for cysteine. This does not alter the HGRF activity es- 
sentially, and facilitates the cleavage of HGRF from 
the fusion protein by CNBr cleavage (Itakura et al . , 
1977). From phage lambda MOX-4 (Fig. 10 
Sphl (position -491) -Kpnl fragment is isolated and in 
serted into Sphl -Kpnl cleaved M13mpl9. This results in 
PUR 3 207. pUR 3 206 is cleaved with Kpnl and the HGRF 
gene is inserted into the Kpn l site of pUR 3207, 
resulting in pUR 3 208. The orientation is checked by 
direct sequence analysis on the single- stranded DNA of 
PUR 3208. Subsequently the downstream part of the MOX 
gene, from the unique Kpnl site up to the Sacl site, 
is isolated as a 1.5 kb fragment from phage lambda 
MOX-4 and inserted into Sacl - partially Kgnl cleaved 
PUR 3208. The orientation of the resulting plasmid pUR 
3209 is checked by digestion with Kpnl. pUR 3209 in- 
tegrates into the chromosomal MOX gene of H. polymorpha 
after transformation of the Sacl, Sph l cleaved plasmid. 
Selection on immunological activity. 

This MOX-HGRF fusion gene is inserted into pHARSl by 
isolation of the whole fusion gene from partially 
Hindll I, partially EcoRI cleaved pUR 3209, into EcoRl 
partially Hindlll cleaved pHARSl . This results in pUR 
3210, which replicates in H. polymorpha after trans- 
formation (Fig. 16E) . Alternatively, the LEU2- 
containing Sall-Hpal fragment of pURS 528-03 is in- 
serted into the blunt-ended Kpnl site of the HGRF gene, 
located at the carboxy terminus of the encoded protein, 
after partial Kpnl cleavage of pUR 3 209. The resulting 
plasmid pUR 3211 integrates into the chromosomal MOX 
gene of H. polymorpha , after transformation of the 
Sacl, Sph l cleaved plasmid (Fig. 16F) . 
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Discussion 



From the length of the open reading frame, from the 
similarity in the amino acid composition of purified 
MOX and the DNA derived protein sequence and from the 
identical 30 N-terminal amino acids, it is concluded 
that the complete gene for MOX from the yeast Hansenula 
polymorpha has been cloned, its calculated molecular 
weight agrees well with the molecular weight determined 
on SDS polyacrylamide gels. Apart from the coding 
sequence, more than 1200 bp has been sequenced from 
both the 5 1 - and the 3 1 -non-coding regions, reaching 
from the Sai l site upstream of the coding sequence, up 
to the Sad site downstream. The gene appears not to be 
interrupted with intervening sequences. 

The protein is not transcribed in the form of a pre- 
cursor. Based on the determination of the molecular 
weight, N-terminal signal sequences could not be 
detected in earlier studies of Roa and Blobel (1983) or 
Roggenkamp et al . (1984) as well. In similar studies, 
it was suggested that also the rat liver peroxisomal 
enzymes uricase (Goldman and Blobel, 19 78) and catalase 
(Goldman and Blobel, 1978; Robbi and Lazarow, 1978) do 
not contain a cleavable N-terminal signal peptide. 
However, as discussed by these authors, proteolytic 
degradation could possibly explain the lack of the 
detection of such a signal sequence. 

Our sequence results definitely prove that for trans- 
location of this protein to the peroxisome, a cleavable 
N-terminal signal sequence is not required. Such a 
translocation signal may well be situated in the 
internal sequence of the mature protein, as is the case 
for ovalbumine (Lingappa et al . , 1979). Inspection of 
the protein sequence reveals the amino acid sequence 
Gly X Gly Y Z Gly (amino acids 13-18), which is charac- 
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teristic for FAD- (flavin adenine dinucleotide)- 
containing enzymes (Ronchi et al., 1981). 

The isolation of the MOX gene described above gives a 
way how to determine the DNA sequence coding for MOX 
and the amino acid sequence of the MOX enzyme. 

Similarly, the DNA sequences and amino acid sequences 
belonging to other oxidase-enzymes can be isolated and 
determined. The knowledge of the MOX gene sequence can 
be used to facilitate the isolation of genes coding for 
alcohol oxidases or even other oxidases. By comparing 
the properties and the structure of enzymes one can 
probably establish structure function and activity 
relationships. One can also apply methods as site- 
directed mutagenesis, or shortening or lengthening of 
the protein coding sequences, modifying the corres- 
ponding polypeptides, to select oxidase-enzymes with 
improved properties, e.g. with increased alkali 
stability, improved production, or oxidase-enzymes 
which need a substrate which is more compatible with 
detergent products. 

Besides the isolation and characterization of the 
structural gene for MOX from the yeast H. polymorpha , 
also the isolation and characterization of the struc- 
tural gene for DHAS from the yeast H. polymorpha has 
been carried out in a similar way. 

The DNA sequence of DAS is given in Fig. 18A-18C. A 
restriction map is given in Fig. 19. The amino acid 
composition calculated from the DNA sequence of DAS ap- 
peared to be in agreement with the amino acid com- 
position determined after hydrolysis of purified DHAS. 
The DHAS enzyme catalyses the synthesis of dihydroxy- 
acetone from formaldehyde and xylulose monophosphate. 
This reaction plays a crucial role in the methanol- 
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assimilation process (cf. Veenhuis et al . , 1983). 

As described before, the synthesis of MOX and DHAS is 
subject to glucose repression. It has now been found 
that higher levels of MOX are reached when using 
glucose/methanol mixtures as substrates instead of 
0*5% (v/v) methanol. Under the former conditions up to 
30% of the cellular protein consists of MOX, compared 
with up to 20% under the latter conditions. 

It was considered that in the regulons of MOX and DAS 
sequences must exist that play a decisive role in the 
regulation of repress ion/ derepression by glucose or of 
the induction by methanol. Some homology therefore 
might be expected. 

A striking homology of the " TATA- boxes 11 has been found, 
both having the sequence C TAT AAA T A • No other 
homologies in the near upstream region of the MOX and 
DAS regulons have been found. Unexpectedly, a detailed 
study of both regulons has shown a remarkable homology 
of the regulons for MOX and DAS in the region about 
1000 bp upstream of the translation initiation codon. 
A practically complete consecutive region of 65 bp in 
the regulon of MOX is homologous to a 139 bp region in 
the DAS regulon, interspersed by several n on- homo logo us 
regions (see Fig. 20). A similar homology is not found 
in any other region of both genes, that are over 4 kb 
in length including their upstream and downstream 
sequences. It is suggested that these homologous 
sequences play a role in the regulation of both genes 
by glucose and methanol. Transformation studies with 
vectors containing as regulon the first 500 bp upstream 
of the ATG of the structural gene of MOX, showed that 
this shortened MOX-regulon gave rise to a relatively 
low expression of the indicator gene beta-lactamase . 
Indicator genes are genes which provide the yeast with 
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properties that can be scored easily, e.g. the gene 
for neomycin phosphotransferase giving resistance to 
the antibiotic G 418 (cf. Watson et al . , 1983) or an 
auxotrophic marker such as leucin. 

The fact that the far upstream homologous regions in 
the MOX and DAS genes have different interruptions and 
the fact that DAS is repressed at 0.1% glucose and MOX 
is not, suggest that these homologous regions are of 
importance to the repr es s ion-derepre s s ion by glucose 
and/or the induction of the expression in the presence 
of methanol. This assumption has been found correct in- 
deed, and the presence or absence of these homologous 
regions can therefore be important for specific appli- 
cations. For example, if the -1052 to -987 region of 
the MOX gene or the -1076 to -937 region of the DAS 
gene is important for the induction of MOX or DAS by 
methanol, the presence of these regions is required for 
the expression of MOX or DAS and/ or for the induction 
of other enzymes by methanol. Another example might be 
the removal of the regions to avoid repression by 
glucose, which is needed for the expression of genes 
coding for proteins other than MOX and DHAS under 
influence of the MOX and/ or DAS regulatory regions with 
glucose as a carbon source. 

Thus one aspect of the present invention relates to the 
isolation and complete characterization of the struc- 
tural genes coding for MOX and DHAS from the yeast H. 
polymorpha . It further relates to the isolation and 
complete characterization of the DNA sequences that 
regulate the biosynthesis of MOX and DHAS in H. poly - 
morpha , notably the regulons and terminaters. 

Moreover, it relates to combinations of genes coding 
for alcohol oxidase or other oxidases originating from 
H. polymorpha strains other than H. polymorpha CBS 
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4732, or Hansenula species other than H. polymorpha , or 
yeast genera other than Hansenula , or moulds, or higher 
eukaryotes, with the powerful regulon and terminater 
of the MOX gene from H . polymorpha CBS 473 2. These 
combinations may be located on vectors carrying amongst 
others an autonomously replicating sequence originating 
from *?' polymorpha or related species or minichromo- 
somes containing centromers, and optionally selection 
marXer(s) and telomers. These combinations may also be 
integrated in the chromosomal DNA of H. polymorpha . 

Furthermore it relates to combinations of the powerful 
regulon or parts of it and terminaters of the MOX 
and/or DAS and - by site-directed mutagenesis or other 
methods - changed structural genes coding for alcohol 
oxidase or another oxidase. These changed structural 
genes may be located on episomal vectors, in mini- 
chromosomes or integrated in the chromosomes of H. 
polymorpha , H • wingeii , H • anomala , and s. cerevisiae 
or in other yeasts. 

Besides this, the present invention relates to com- 
binations of the regulon and terminater of the MOX 
and/ or DAS gene of H • polymorpha with structural genes 
coding for other proteins than oxidases. 

A very important and preferred embodiment of the in- 
vention is a process for preparing a polypeptide, such 
as a protein or an enzyme, by culturing a microorganism 
under suitable conditions, optionally concentrating the 
polypeptide and collecting same in a manner known per 
se, characterized in that a microorganism is used that 
has been obtained by recombinant DNA technology and 
caries a structural gene coding for the polypeptide 
concerned, the expression of which is under the control 
of a regulon, comprising a promoter and at least either 
the -1052 to -987 region of the MOX gene of Hansenula 
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polymorpha CBS 4732, or the -1076 to -937 region of the 
DAS gene of Hansenula polymorpha CBS 4732, or a corres- 
ponding region of other methylotrophic moulds or 
yeasts, or an effective modification of any of these 
5 regions. 

Surprisingly, it has been observed by the present in- 
ventors that the regions concerned, which are shown in 
Fig. 20 and are referred to herein as the -1000 regions 

10 of the MOX and DAS genes, are of crucial importance for 
the expression of the structural gene concerned. Ex- 
periments performed with recombinants containing the 
MOX regulon from which this region was eliminated 
showed a low level of expression. Therefore, use of a 

15 regulon comprising such -1000 region, or an effective 
modification thereof, i.e. any modification which does 
not result in a significant mutilation of the function 
of said region, makes it possible to realize production 
of a relatively high amount of the desired polypeptide. 

20 

A preferred embodiment of this process according to the 
invention is characterized in that the structural gene 
concerned has been provided with one or more DNA 
sequences coding for amino acid sequences involved in 

25 the translocation of the gene product into the per- 
oxisomes or equivalent microbodies of the microbial 
host. Translocation of the produced polypeptide into 
the peroxisomes or equivalent microbodies improves 
their stability, which results in a higher yield. For 

30 certain kinds of polypeptides, in particular oxidases, 
such translocation is imperative for survival of the 
microbial host, i.e. to protect the host against the 
toxic effects of the hydrogen peroxide produced when 
the microbial host cells are growing on the substrate 

35 of the oxidase. If the oxidase concerned does not 

contain addressing signals which are functional in the 
microbial host used in the production process, one 
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should provide the structural gene with sequences 
coding for host specific addressing signals, for 
example by adding such sequences or by substituting 
these for the original addressing sequences of the 
gene. Production of a fused polypeptide, in which the 
fusion partner carries suitable addressing signals, is 
another possibility. In case methylotrophic yeasts are 
used in the production process, it is preferred that 
the DNA sequences consist of the MOX gene or thos parts 
thereof which are responsible for MOX translocation 
into the peroxisomes or microbodies. 

Finally, this aspect of the present invention is re- 
lated to the synthesis of MOX originating from H. poly- 
morpha in other yeasts. 

Some microorganisms with the potential of producing 
alcohol oxidases are summarized below. 

Yeasts producing alcohol oxidases 

(Taxonomic division according to Lee and Komagata, 1980) 

Group 1 Candida boidinii 

Group 2a Hansenula phllodendra 
Pichia lindnerii 
Torulopsis nemodendra 

" pinus 

" sonorensis 
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Group 2b Candida cariosilignicola 
Hansenula glucozyma 
henricii 



minuta 



5 



ii 



nonfermentans 



ii 



polymorpha 
wickerhamii 



Pichia pinus 



trehalophila 



10 



Group 2c Candida succiphila 

Torulopsis nitratophila 



Group 3 Pichia cellobiosa 

15 

Group 4 Hansenula capsulata 
Pichia pastoris 
Torulopsis molischiana 

20 Moulds producing alcohol oxidases: 



25 

Among the oxidases other than alcohol oxidases, the 
most interesting are: 

- glycerol oxidase, 

- aldehyde oxidase, 
30 - amine oxidase, 



Lenzites trabea 



Polyporus versicolor 

" obtusus 
Poria contigua 
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- aryl-alcohol oxidase, 

- amino acid oxidase, 

- glucose oxidase, 

- galactose oxidase, 

- sorbose oxidase, 

- uric acid oxidase, 

- chloroperoxidase, and 

- xanthine oxidase. 
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Combinations of the powerful regulons and terminaters 
of the MOX and DAS genes from H ♦ polymorpha and struc- 
tural genes for oxidases may be combined with one or 
more DNA sequences that enable replication of the 
5 structural gene in a particular host organism or group 
of host organisms, for example autonomously replicating 
sequences or centromers (and telomers) originating from 
H. polymorpha , to suitable vectors that may be trans- 
ferred into H. polymorpha and related yeasts or other 
10 microorganisms . 



H. polymorpha mutants LEU-1 and LR9, mentioned on page 
12 of this specification, were deposited at the 
Centraalbureau voor Schimmelcultures at Delft on 15th 
July, 1985, under numbers CBS 7171 and CBS 7172, res- 
pectively. 

The above description is followed by a list of refer- 
ences, claims, Tables, Legends to Figures and Figures. 
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TABLE I 

Activities of orotidine 5 '-phosphate decarboxylase and 
orotidine 5 •-phosphate pyrophosphorylase in H. poly - 
5 morpha mutants requiring uracil for growth. 



Activity (%) a 

Strain/ Reversion 



Genotype rate orotidine 5'- Orotidine 5- 

10 phosphate phosphate 

d ecarboxyl as e pyr ophosphory 1 as e 



Wild type - 100 100 

15 LR 9/odcl <2 x 10 9 <1 106 

MR 7/odcl 6 x 10 7 <1 71 

NM 8/odcl 3 x 10 8 <1 105 

CLK 55/oppl n.e. b 90 <l 

CLK 68/oppl n.e. 82 <1 

20 YNN 27/ura3 n.e. 0 n.e. 



Strains were grown in YEPD until late exponential 
phase. Extraction of cells was performed with glass 
25 beads using a Braun homogenizer. Protein was estimated 
by the optical density at 280 nra. 

a ) Expressed as the percentage of wild type activity. 
30 b ) Not estimated. 



Hi 

TABLE II 
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Transformation of uracil-requiring mutants of H. poly - 
morpha 



O T- A aJLil 


ir J_ a SIQ _Ltl 


ildlloXUlJUaui. Ull 

frequency 3 


Q-t- A"h ill + 
(%) 


transformed 
DNA 


LR 9 


YRP17 


2.2 x 10 2 


<1 


Autonomous 
replication 


LR 9 


pHARSl 


1.5 x 10 3 


2 


Autonomous 
replication 


LR 9 


pHARS2 


4.6 x 10 2 


1.5 


Autonomous 
replication 


LR 9 


YIP5 


3 (38) c 


105 


Integration 


LR 9 


pRB58 


0 






LR 9 


pHH85 


0 






YNN 27 


YIPS 


0 







10 



15 



20 



25 



a) 



Expressed as total number per ^ug of DNA. Intact 
cells treated with polyethyleneglycol were used for 
transformation as described in Materials and Methods* 



b ) Expressed as the percentage of remaining uracil 

prototrophs after growth on YEPD for ten generations, 



30 c ^ Number in parentheses indicates the amount of mini- 
colonies containing free plasmid YIPS. 
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TABLE III 
Amino acid composition of MOX 
5 Amino Acid DNA sequence Hydrolysate a ) 





pup 


o± 




3 2 




LEU 


47 








ILE 


34 




o*t 


10 




1 ^ 

Xa 




11 




VAT. 






43 




SER 


*»o 




oo a) 




J-\ 7"> O 

PRO 


43 




42 




THR 


44 




38 


15 


ALA 


47 




50 




TYR 


27 




27 




HIS 


19 




21 




GLN 


13 








GLU 


36 


3 


51 


20 


ASN 


32 








ASP 


50 


3 


84 




LYS 


35 




38 




CYS 


13 




12 




TRP 


10 




. b) 


25 


ARG 


36 




36 




GLY 


50 




53 



a) Hydrolysis was performed for 24 
30 b) Not determined. 
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TABLE IV 

Comparison of preferred codon usage in S. cerevisiae , 
H . polyroorpha and E. coli 



Saccharomyces 



ALA GCU, GCC 



10 



15 



20 



25 



SER 
THR 
VAL 

ILE 
ASP 
PHE 
TYR 
CYS 
ASN 
HIS 
GLU 
GLY 

GLN 
LYS 
PRO 
LEU 
ARG 



UCU, UCC 
ACU, ACC 
GUU, GUC 

AUU, AUC 

GAC 

UUC 

UAC 

UGU 

AAC 

CAC 

GAA 

GGU 

CAA 
AAG 
CCA 
UUG 
AGA 



not 



Hansenula 
MOX 

GCC 

UCC, UCG 
ACC 

GUA not used, 
no clear pref. 

AUC, AUU 
GAC 
UUC 
UAC 

no clear pref. 

AAC 

CAC 

GAG 

GGC practically 
used, no clear pref 

CAG 
AAG 

CCU, CCA 
CUG, CUC 
AGA 



E . coli 

GCC not used, 
no clear pref. 

UCU, UCC 
ACU, ACC 
GUU, GUA 

AUC 
GAC 
UUC 
UAC 

no clear pref, 

AAC 

CAC 

GAA 

GGU, GGC 

» 

CAG 
AAA 
CCG 
CUG 
CGU 
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Legends to Figures 

Fig. 1. The exonuclease Bal 31 digestion strategy used 

in sequencing specific MOX subclones. The frag- 
5 ment X-Y subcloned in M13mp-8 or -9, -18 or -19 

is cut at the unique restriction site Z. The 
DNA molecule is subjected to a time- dependent 
exonuclease Bal31 digestion. The DNA fragment 
situated near the Ml 3 sequencing primer is 

10 removed using restriction enzyme Y; ends are 

made blunt end by incubation with T4-DNA 
polymerase and then ligated intramolecularly . 
Phage plaques are picked up after trans- 
formation and the fragment is sequenced from 

15 site Z in the direction of site X. Using the 

M13 derivative with a reversed multiple cloning 
site, the fragment is sequenced from site Z in 
the direction of site X. 

20 Fig. 2. Alignment of pHARS plasmids derived by in- 
sertion of HARS fragments into the single Sai l 
site of YIp5. 

Fig. 3. The complete nucleotide sequence of the HARS-1 
25 fragment. 

Fig. 4. Estimation of copy number by Southern hybrid- 
ization of H. polymorpha transf ormants . An 
aliquot of 8 and 16 ^ul of each probe was 

30 electrophoresed. Lane 1, phage lambda DNA di- 

gested with Hindlll and Eco RI . Lanes 2,3 trans- 
formant of K. lactis containing two copies of 
integrated plasmid, digested with Hin di! I (M. 
Reynen, K. Breunig and CP. Hollenberg, un- 

35 published); lanes 4-7, YNN 27, transformed with 

pRB58 (4-5) and YRP17 (6-7) digested with Eco RI 
respectively; lanes 8,9, LR9 transformed with 
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YRP17 digested with EcoRI ; lanes 10,11, LR9 
transformed with pHARS2 digested with Hin dlll ; 
lanes 12,13, LR9 transformed with pHARSl di- 
gested with Eco RI , 

5 

Fig, 5, Autoradiogram of Southern blots of DNA from H . 
polymorpha mutant LR9 transformed by inte- 
gration of plasmid YIp5. Lane 1, phage lambda 
DNA, digested both with Hindlll and EcoRI; lane 

10 2, pHARS-1, undigested; lanes 3-5 and lanes 6,7 

show DNA from 2 different transformants, Lane 
3, undigested; lane 4, digested with EcoRI; 
lane 5, digested with Pvu II ; lane 6, digested 
with Eco RI ; lane 7, digested with Pvu II ; lane 

15 8, plasmid YIp5, digested with EcoRI. Nick- 

translated YIp5 was used as a hybridization 
probe . 

Fig. 6 Electrophoresis of 32 p-labelled RNA from 
20 Hansenula polymorpha , purified once (lane A) or 

twice (lane B) on oligo( dT) cellulose . Electro- 
phoresis was performed on a denaturing 7 M urea 
2.5% polyacrylamide gel. The position of the 
yeast rRNA's and their respective molecular 
25 weights are indicated by 18S and 25S. The 2.3 

kb band, that can be seen in lane B, was con- 
verted into a cDNA probe which was subsequently 
used to isolate MOX and DHAS clones from the 
Hansenula polymorpha clone bank. 

30 

Fig. 7 35 S-labelled proteins obtained after in vitro 
translation of methanol derepressed, Hansenula 
polymorpha mRNA with a rabbit reticulocyte 
lysate. Either 2 microliters of the total 
35 lysate (lane A) or an immuno-precipitate of the 

remaining 18 microliters using a MOX specific 
antiserum (lane B) were separated on an 11.5% 
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SDS-polyacrylamide gel. A mixture of proteins 
with known molecular weights was used as 
markers. 

8. The N- terminal sequence of purified MOX, as 
determined on a Beckman sequenator. The two 
probes that could be derived from the sequence 
Pro-Asp-Gln-Phe-Asp, using Saccharomyces pre- 
ferred codons, are indicated. 

9. Hybridization of a DBM blot of Hindlll/Sall cut 
MOX clones. The DNA was separated on a 1.5% 
agarose gel (Fig. 9A) and the blot was hybrid- 
ized to a mixture of both MOX-derived synthetic 
DNA probes (Fig. 8). Only one band of clones 1, 
4 and 5 hybridize (Fig. 9B), indicated by an 
arrow in Fig. 9A. Lane M: molecular weight 
markers as indicated. Lane A, B, C and D: 
clones 1, 3, 4 and 5, respectively. Lane E: 
lambda L47.1. 

10. Restriction map for MOX clone 4. Only relevant 
restriction sites are indicated that have been 
used for subcloning and sequencing of the MOX 
gene. The open reading frame, containing the 
structural MOX sequence, and the Ml 3 subclones 
made are depicted. Restriction sites used are: 
B= BamHI , Ej m Eco RI, E y = EcoRV, p = 

PstI, SI = Sai l, Sc m Sad , St * Stu I, H = 
Hindlll, Sp - S£hl, K - Kpnl, Hg - HgiAI and 
X - Xmal. 

11A,B. The nucleotide sequence of the MOX structur- 
al gene and its 5'- and 3* -flanking sequence. 

12A,C. The construction of plasmid pUR 3105 by 

which the neomycin phosphotransferase gene 
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integrates into the chromosomal MOX gene of 
H . polymorpha . 

Fig. 12B. Promoter MOX-neomycin phosphotransferase 
5 adapter fragments. 

Fig. 13. The DNA sequence of the AAO gene, derived from 
the published amino acid sequence. The gene 
is synthesised in the optimal codon use for 

10 H. polymorpha in oligonucleotides of about 50 

nucleotides long. Restriction sites, used for 
subcloning are indicated. The Hgi AI- Sal l frag- 
ment forms the adapter between the structural 
AAO gene and the MOX promoter. The trans- 

15 lational start codon (met) and stop codon 

(***) are indicated. The structural sequence 
is numbered from 1 to 1044, while the MOX 
promoter is numbered from -34 to -1. 

20 Fig. 14A. The construction of pUR 3003, by which the 

AAO gene integrates into the chromosomal MOX 
gene of H. polymorpha . Selection on activity 
of the AAO gene. 

25 Fig. 14B. The construction of pUR 3004, by which the 

AAO gene integrates into the chromosomal MOX 
gene of a H. polymorpha leu" derivative. 
Selection on leu + . 

30 Fig. 14C. The construction of pURS 528-03. Owing to the 
removal of the pCRl sequence and the double 
lac UV5 promoter, this plasmid is about 2.2 
Xb shorter than pURY 528-03. 
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Fig. 15. The DNA sequence of the HGRF gene, derived 
from the published amino acid sequence. The 
gene is synthesised in the optimal codon use 
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for H. polymorpha in oligonucleotides of about 
50 nucleotides long, Hgi AI, Hindlll and Sai l 
sites are used for subcloning. The Hgi AI- 
Hindi II fragment forms the adapter between the 
5 structural HGRF gene and the MOX promoter. The 

translational start codon (met) and stop codon 
(***) are indicated. The structural sequence 
is numbered from 1 to 140, while the MOX 
promoter is numbered from -34 to -1. 

10 

Pig. 16A. The construction of pUR 3203, by which the 
gene coding for HGRF integrates into the 
chromosomal MOX gene of H. polymorpha . 
Selection on immunological activity of HGRF. 

15 

Fig. 16B. The construction of pUR 3204, by which the 
gene coding for HGRF integrates into the 
chromosomal MOX gene of a H. polymorpha leu~ 
derivative. Selection on leu + . 

20 

Fig. 16C. The construction of pUR 3205, by which the gene 
coding for HGRF is inserted into a HARS-1- 
containing plasmid, which replicates autono- 
mously in H. polymorpha . Selection by trans- 
25 formation of a ura" mutant. 



Fig. 16D. The construction of pUR 3 209, by which the gene 
coding for HGRF integrates into the chromosomal 
MOX gene of H . polymorpha , fused to the 
structural MOX gene. HGRF is cleaved from the 
fusion protein by CNBr cleavage. Selection on 
immunological activity of HGRF. 

Fig. 16E. The construction of pUR 3 210, by which the gene 
coding for HGRF is inserted into a HARS-1- 
containing plasmid, fused to the structural MOX 
gene. Selection as in Fig. 16C. 
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Fig. 16F. The construction of pUR 3211, by which the 
gene coding for HGRF integrates iato the 
chromosomal MOX gene of a H. polymorpha leu" 
derivative, fused to the structural MOX gene. 
5 Selection on leu + . 

Fig. 17. The DNA sequence of the HGRF gene, derived 
from the published amino acid sequence. The 
gene is synthesised aB mentioned in Fig. 15 , 

10 but constructed in such a way that it could be 

inserted into fhe unique Kpn l site of the 
structural MOX gene. Therefore it was equipped 
with Kpn l sites on both sides of the gene, and 
KpnI-Hindlll fragments were used for sub- 

15 cloning. Synthesis will be as a fusion product 

to the MOX enzyme. The internal met (ATG) at 
position 82 is converted into a cys (TGT) . 
Translational start (met) and stop (***) codons 
are indicated. 

20 

Fig. 18A,B,C. The nucleotide sequence of the DAS struc- 
tural gene and its 5'- and 3' -flanking 
sequence . 

25 Fig. 19. Restriction map for the DAS-lambda clone. 

Only relevant restriction sites are indicated 
that have been used for subcloning and 
sequencing of the MOX gene. The open reading 
frame, containing the structural DAS sequence, 

30 and the Ml 3 subclones made, are depicted. 

Fig. 20. Identical sequences in -1000 region of DAS and 
MOX genes. 
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CLAIMS 

1- Process for preparing an oxidoreductase by 

culturing a microorganism under suitable conditions, 
optionally concentrating the enzyme and collecting the 
concentrated enzyme in a manner known per se, charac- 
terized in that a microorganism is used that has been 
obtained by recombinant DNA technology , and which is 
capable of producing the oxidoreductase. 

2. Process according to claim 1, characterized in 

that the microorganism is capable of producing at least 
one enzyme selected from the group consisting of 

(1) alcohol oxidases, 

(2) amine oxidases, including alkylamine oxidase and 
benzyl amine oxidase, 

(3) amino acid oxidases, including D-alanine 
oxidase, lysine oxidase, 

(4) cholesterol oxidase, 

(5) uric acid oxidase, 

(6) xanthine oxidase, 

(7) chloroperoxidase, and 

(8) aldehyde oxidase. 

3* Process according to claim 1 or 2, charac- 

terized in that the microorganism is a mould or yeast. 

4. Process according to claim 3, characterized in 
that a mould or yeast is selected from the group con- 
sisting of the genera Aspergillus , Candida , Geotrichum , 
Hansenula , Lenzites , Nadsonia, Pichla , Poria , 
Polyporus , Saccharomyces , Sporobolomyces , Torulopsis , 
Trichospora and Zendera . 

5. Process according to claim 4, characterized in 
that the mould or yeast is selected from the species 
Aspergillus japonicus , Aspergillus niger , Aspergillus 
oryzae, Candida boidinii, Hansenula anomala, Hansenula 



0173378 



polymorpha , Hansenula wingeii , KloecXera sp.22Cl and 
Pichia pastoris . 

6* Process according to any one of claims 1-5, 

5 characterized in that the microorganism is also capable 
of producing a dihydroxyacetone synthase enzyme, which 
promotes the formation of dihydroxyacetone from form- 
aldehyde. 

10 7. Use of an oxidoreductase prepared by a process as 
claimed in any one of claims 1-5 in an oxidation pro- 
cess. 

8. Bleaching composition including a fabric- 

15 washing detergent composition or hard-surface-cleaning 
composition having bleach activity, characterized in 
that it contains an oxidoreductase prepared by a 
process as claimed in any one of claims 1-5 and a 
substrate for that oxidoreductase. 

20 

9. Microorganism, obtainable by recombinant DNA 
technology and being capable of producing an oxido- 
reductase suitable for use in a process as claimed in 
claims 1-5. 

25 

10. Microorganism, obtainable by recombinant DNA 
technology and being capable of producing a dihydroxy- 
acetone synthase-enzyme suitable for use in a process 
according to claim 6, in addition to being capable of 

30 producing an oxidoreductase. 

11. Process for preparing a transformed micro- 
organism as claimed in claim 9, characterized in that a 
DNA sequence coding for an oxidoreductase together with 

35 one or more other DNA sequences which regulate the 

expression of the structural gene is introduced into 
the microorganism via an episomal vector or integration 
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in.the genome,, sueh ' th^jfch^ ^^ 

of producing the pxitfOiiredjuctase . t ^ ^to^sli/ps? ) ail 



12. Process f or n preparing tf^n ? f jpj^^rai cr o- ftr 
5 organism as claimed in claim 10^ cji^r^^^r^ed^ n^fjftajt 

a DNA coding for a <&ihydroxyacetone : $&P^§P§r-^£gf&% f i* 
together with one or more other DNA.* sequ^^ce^f J^f^jjv^ e 
regulate the expression of the structural gene is 
introduced into the microorganism via an-, episomal 
10 vector or integration in the genome, .such that^ the ; . 

microorganism is capable of producing the dxh^dxioxyr-. v 
acetone synthase— enzyme ( DHAS enzyme)*, r , 

13. DNA sequence coding for an oxidoreductase, 
15 characterized in that it is obtainable by recombinant 

DNA technology from natural and/ or cDNA and/ or cherr^x- 
cally synthesised DNA* 

14. DNA sequence according to claim 13, charac- « 
20 terized in that it codes for an alcohol oxidase* 



15.-. DNA sequence according to claim 14 r charac- 

terized in that it comprises the DNA sequence 1-1992 
(MQX gene) given in Fig. 11A + 11B encasing v ^he .poly- 
25 peptide 1-664 (MOX) , the amino acid sequence of which 
is given in Pig. 11A + 11B. 

16;' Combination of DNA sequences , cc?mpris^g r a . 

structural gene coding for an oxidoreductase .a,n<}, one or 
30 more other DNA sequences which regulatQ the.^pr^ssipn 
of the structural gene in a particular microorganism f or 
group of microorganisms. 

1 ' .a;: oD m £ £ 

17. Combination of DNA seque^Geg^gcgpgdigcj Ki q bI^ 

35 claim 16, characterized in that it CQmp^ij|ej J5 ^ v J.^|t 
part of the upstream DNA sequence ?1 ^o^bflj^^-lgpg^pr 
given in Fig. 11A and/or at least part of the down- 
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stream DNA sequence 1993 to about 3260 given in Fig. 
11B (regulatory regions of the MOX gene) . 

18. Combination of DNA sequences according to 

5 claim 17, characterized in that it comprises at least 
the polynucleotide -1052 to -987 of the upstream DNA 
sequence given in Fig. 11A. 

19. Combination of DNA sequences according to 

10 claim 17, characterized in that it contains a modified 
MOX promoter sequence which is obtainable by deletion 
of at least polynucleotide -105 2 to -987 given in Fig. 
11A. 

15 20. Combination of DNA sequences according to 

claim 16, characterized in that it comprises at least 
part of the upstream DNA sequence -1 to about -2125 
given in Fig. 18A + 18B and/ or at least part of the 
downstream DNA sequence 2107 to about 235 0 given in 

20 Fig. 18C (regulatory regions of the DAS gene). 

21. Combination of DNA sequences according to 
claim 20, characterized in that it comprises at least 
the polynucleotide -1076 to -937 of the upstream DNA 

25 sequence given in Fig. 18A. 

22. Combination of DNA sequences according to 
claim 20, characterized in that it contains a modified 
DAS promoter sequence which is obtainable by deletion 

30 of at least polynucleotide -1076 to -937 given in Fig. 
18A. 

23. Combination of DNA sequences according to 
claim 16, characterized in that it comprises a 

35 structural gene coding for an oxidoreductase of a 
higher eukaryote, a mould, or a yeast. 
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24. Combination of DNA sequences accordirg to 
claim 23, characterized in that it comprises a struc- 
tural gene coding for an oxidoreductase of a yeast of 
the genus Hansenula , preferably of the species H. 

5 polymorpha . 

25. Combination of DNA sequences according to 
claim 16, characterized in that the structural gene 
coding for an oxidoreductase encodes an alcohol 

10 oxidase. 

26. Combination of DNA sequences according to 
claim 25, characterized in that the structural gene is 
the DNA sequence 1-1992 (MOX gene) given in Fig. 11A + 

15 11B encoding the polypeptide 1-664 (MOX), the amino 
acid sequence of which is given in Fig. 11A + 11B . 

27. Combination of DNA sequences according to 
claim 16, characterized in that it also contains a 

20 structural gene coding for DHAS. 

28. Combination of DNA sequences according to 
claim 27, characterized in that it contains a struc- 
tural gene coding for DHAS having the amino acid 

25 sequence as given in Fig. 18B + 18C. 

29. Combination of DNA sequences according to any 
one of claims 16-28, characterized in that the DNA 
sequences have been modified, while retaining their 

30 coding function for an oxidoreductase or for their 

regulatory functions, by recombinant DNA technology. 

30. Combination of DNA sequences according to any 
one of claims 16-29, characterized in that it contains 

35 one or more DNA sequences that enable stable inheri- 
tance of said combination in the progeny of any partic- 
cular host organism. 



0173378 



31. Combination of DNA sequences suitable for the 

transformation of a microbial host to produce a 
specific enzyme or other protein which combination of 
DNA sequences contains a regulon, a structural gene 
5 coding for that specific enzyme or other protein and 
optionally a terminater, characterized in that a 
regulon is used selected from the group consisting of 
at least part of the regulon -1 to about -1500 of the 
MOX gene given in Fig. 11A or at least part of the 

10 regulon of -1 to about -2125 of the DAS gene given in 
Fig. 18A and modifications thereof that do not impair 
the regulon function, and optionally a terminater is 
used selected from the group consisting of at least 
part of the terminater 1993 to about 3260 of the MOX 

15 gene given in Fig. 11B or at least part of the ter- 
minater of 2110 to about 2350 of the DAS gene given in 
Fig. 18B and modifications thereof that do not impair 
the terminater function. / 

20 32. Combination of DNA sequences according to 

claim 31, characterized in that it is suitable for 
transformation of a Hansenula yeast, in particular a 
Hansenula polymorpha . 

25 33. Combination of DNA sequences according to 

claim 31 , characterized in that it is suitable for 
transformation of a Saccharomyces yeast, in particular 
Saccharomyces cerevisiae . 

30 34. Combination of DNA sequences according to claim 
31, characterized in that the structural gene coding 
for that specific enzyme or other protein contains DNA 
sequences derived from the structural gene coding for 
MOX (Fig. 11A + 11B) , which modify said specific enzyme 

35 or other protein, without impairing its functions, in 
such a way that said specific enzyme or other protein 
is translocated into the peroxisomes or equivalent 
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microbodies of said microbial host. 

35. DNA sequence coding for a dihydroxyacetone 
synthase-enzyme, characterized in that it is obtainable 

5 by recombinant DNA technology from natural and/ or cDNA 
and/ or chemically synthesised DNA. 

36. DNA sequence according to claim 35, charac- 
terized in that it comprises the DNA sequence 1-2106 

10 (DAS gene) given in Fig. 18B + 18C encoding the poly- 
peptide 1-702 (DHAS), the amino acid sequence which is 
given in Fig. 18B + 18C. 

37. Combination of a DNA sequence coding for a 

15 dihydroxyacetone synthase- enzyme and one or more other 
DNA sequences which regulate the expression of the 
structural gene in a particular microorganism or group 
of microorganisms. 

20 38. Combination of DNA sequences according to 

claim 37, characterized in that it comprises the DNA 
sequence according to claim 36 (DAS gene) and at least 
part of the upstream DNA sequence -1 to about -2125 
given in Fig. 18A + 18B and/ or at least part of the 

25 downstream DNA sequence 2107 to about 2350 given in 
Fig. 18C (regulatory regions of the DAS gene) and/or 
at least part of the upstream DNA sequence -1 to 
about -1500 given in Fig. 11A and/or at least part of 
the downstream DNA sequence 1993 to about 3260 given in 

30 Fig. 11B (regulatory regions of the MOX gene) . 

39. Combination of DNA sequences according to 

claim 38, characterized in that it comprises at least 
the polynucleotide -1076 to -9 37 of the upstream DNA 
35 sequence given in Fig. 18A or at least the poly- 
nucleotide -1052 to -987 of the upstream DNA sequence 
given in Fig. 11A, respectively. 
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40* Process for preparing a polypeptide, such as a 

protein or an enzyme, by culturing a microorganism 
under suitable conditions, optionally concentrating the 
polypeptide and collecting same in a manner known per 
5 s&, characterized in that a microorganism is used that 
has been obtained by recombinant DNA technology and 
carries a structural gene coding for the polypeptide 
concerned, the expression of which is under the 
control of a regulon, comprising a promoter and at 

10 least either the -1052 to -987 region of the MOX gene 
of Hansenula polymorpha CBS 4732, or the -1076 to -937 
region of the DAS gene of Hansenula polymorpha CBS 
47 32, or a corresponding region of other methyl otrophic 
moulds or yeasts, or an effective modification of any 

15 of these regions. 

41. Process according to claim 40, characterized 
in that the promoter is derived from the yeast 
Hansenula polymorpha . 

20 

42. Process according to claim 40 or 41, charac- 
terized in that the microorganism is a mould or yeast. 

43. Process according to any of claims 40-42, 

25 characterized in that a mould or yeast is selected from 
the group consisting of the genera Aspergillus , 
Candida , Geotrichum , Hansenula , Lenzites , Nadsonia , 
Pichia , Poria , Polyporus , Saccharomyces , Sporobolo - 
myces, Torulopsis , Trichospora and Zendera . 

30 

44. Process according to claim 43, characterized 
in that the mould or yeast is selected from the species 
Aspergillus japonicus , Aspergillus niger , Aspergillus 
oryzae , Candida boidinii , Hansenula a noma la , Hansenula 

35 polymorpha , Hansenula wingeii , Kloeclcera sp. 2201 and 
Pichia pastoris . 
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45. Process according to claim 44, characterized 
in that the microorganism is the yeast species 
Hansenula polymorpha , 

46. Process according to any of claims 40-45, 
characterized in that the structural gene concerned has 
been provided with one or more DNA sequences which 
translocate the gene product into the peroxisomes or 
equivalent microbodies of the microbial host. 

47. Process according to claim 46, characterized 
in that the DNA sequences concerned consist of the MOX 
gene or those parts thereof which are responsible for 
MOX translocation into the peroxisomes or microbodies. 
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Hg. 3 DNA sequence of the autonomous-replicating sequence HARS1 
from the methylotrophic yeast Hansenula polymorpha. The 
HARS1 represents a Sail fragment comprising 483 nucleo- 
tides. The dideoxy-sequencing method was employed. 



t 

(QTCGACTCCG CGACTCG6C6 TTCACTTTCG 

AACAGCCGTG CCCCAGGGAC CAGAAAGCCT 

CCGAGGATGA GACGACGATA ACGAGCACAA 

GACGAGCCAC ATCAGCAGGC TGTCAAGACT 

TACTCAAGAC GTTAGTAAAC TCCGTCTGCC 

GAATTACGAA CAATGTAGTC AAAAAAATTT 

TGAAACCAGT GAACTCCAAT AAATCCAGCG 

GGTCTAGGCT TGTTTCAGAG TACTACAAGC 
TTT G^fCGAC) 

HS3 



AGCTATTCAT CAACGCCGGA ATACGTCAGA 
ACTGGTGAGT ATGTTCTTTC GTGTGA7TTT 
CTCGGAGTCG GAGGACACGC TTATTGCGT1 
GAGTATAGGC CACAGAGCTG ATTCTGCTCA 
ACAATGCTGA CAGAGTATTA TAATAATAGT 
AGTAACAATA TGTCATGATG ACAGATTTGC 
GCTACCGCAT CCCAAGAGAA ACAGATCAGA 
TTTCCAGAAC TTAGCAATTC TCAAACGCGG 
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Fig. 8 

NH2~Ala-Ile~Pro~Asp-Glu-Phe-Asp-Ile-Ile-Val-Val-Gly- 

CCA GAC GAA TTC GA 
CCA GAT GAA TTC GA 



-Gly-Gly- * -Thr-Gly-Cys-Cys-Ile-Ala-Gly- * -Leu- 
-Al a- As n-Le u-Asp- As p-G 1 n- As n-Leu 
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CICCACCCCC 

AACCACCCCC 
-1431 

CACCTCCTTT 

ATTCACCATC 

CTTACTCTAC 

CTCCCCTCTC 

CTACTTCTTC 

AT7ACTTTCA 

TACACTCTCA 

CAATCCAACA 

TTCAACCACC 

AACCCCTITT 
-S31 

ACCCCACACA 
TCCACCTCCT 
TCCACTCCAC 
CCCTTCATCC 



TACCCACTCC 
-511 

CCATCCATCC 

TCTCAATCTC 

CACCTCCATC 

TAACCCCACC 

ACACCCCCCA 
-231 

CCCCTCAACC 
CTTTTTCCCC 
CCCACACCCT 



AC AAA AACAAA 

-11 

20 

CLY CYS CYS 
CCC TCC TCC 
40 

ILE CLU CLY 
ATC CAC OCT 

to 

ARC AIM NET 
ACA AAC ATC 
• O 

LEU ASK CLY 
CTC AAC CCC 
100 

asm pme LtU 

AAC ITT CTC 
120 

YEP SEX INK 

TCC ACC ACC 
140 

ASM ASM ABC 
AAC AAC AC A 
140 

TYR HO TMR 

TAT CCT ACC 
ISO 

ASP LEU CLU 
CAC CTC CAC 
200 

AS* LIU CLY 

CAC CTC CCC 
220 

CLM SEE LEU 
CAC ACC CTC 
240 

ALA YAL ALA 

CCT CTC CCC 
240 

THE PME A1C 

ACC TTC AC A 
2S0 

LEU CLM AEG 

CTC CAC ACA 



MET ALA 
ATC CCC 



AC A ACC ATCT CCTCCACCTC CTCCCCCATC ACCTTCTCCC 

ACCCCACCCT CCTTCCCCAC CACCCTCCCT CCCCACCCCA 

ACAACCTCCT CCCCAAACTC CACTCTCAOA TCAATCTCCT 

TTCTCCACCA CCCATCTCTC TTTCCACTAC AACCCTAATC 
-1301 

CCCAACACCT ACTTTCCCTC CCCCCCCATA ATCAACACCT 

ACCACCCCCC ACCTCTCCAC CCCCTCCATC ACCCCCTTCA 
-1201 

CCTCCCTCTA CCCCCCCCCC CTCACCATAC CCACATACAC 

TCACCTCCCC CACCATCCCC CACTCCCCAT CCAAATTTTT 

- 1031 

TCTCACCATC CAATCTAATC ACCTCCACCT TCCCATCTCC 
-1001 

ACCCCCACAT CTCCAACACC TCCCCCCTCT TCACAATCAC 
-»01 CACAAC CCCCCCTTTC CTCATCCCCC CCCCCTCCTC 
CCACACCCAC TCTCCTCAAC AACCTCCCAA CCCTCCCAAC 
ATTCCCCCCT CCCCCCTTTC CTCATTTCAA TCTTCTCCTC 

CCAACATITC CCCCTACCCC CCTTTTCCCT CAC ACTTTAC 
-701 

ACATGCCCTT CCTCTTCACC CCCTACACCA CCAACCCCCT 

A »oi TATCAG CCCATCTCC * ccctcttcct tcactcccta 

ACATCTCCAC ACTCCCCTTC CTCTCCTCCA TCCACCAATT 

TTCCACCCCA ACTTTTTAAA ACCCACTCCC TTTACCCCTC 

-*3l 

CCAACTCACC CCCTTCTCCA CCCCCAACCC AACTTTTCCC 
-401 

CTCTCATCTC ACCCTCTCTT TCCTCCCCTA CCCTACAACC 

C«CTACCCA CTCCTCYCTC TCCCTCCTAC CAC AAA ATC A 

TCTCCCAACT CCTCCCCTCT CCCACACCCT CTTTCTCCAC 

CCCCACAAAC TAAATTCTTA TCCTACCCTC CACCCACTCC 

TACTTCATCA CACATCCCCT CACCCCTCCC CCTAACTCTA 
-101 

CCATCTATAA ATACTCCTCC CACTCCACCC TCCTCACATC 

5 10 
ILE riO ASP CLU PME ASP 1LE ILE VAL YAL CLY 
ATT CCT CAC CAA TTC CAT ATC ATT CTT CTT CCT 



CCCCTAATCC 

CTTTCTC AAC 
-1401 

CCTCCCACCA 

TCTCCTCCTC 

T CTCTTTCTC 

CCCCCTCCTA 

CTCCTTCCCC 
-110} 

CCCCTCCTCC 

CATCCTTTTC 

CCCCACCTCC 

CTCCATCTAC 

CACCTCCACC 
-•01 

CATCACCACT 

CATCACCTCC 

CCCCACCACC 

CTCCACTCTC 

AATTGTTCCC 
-301 

CCCTAAAACT 

TTCCACCACC 

TCACCTTCCC 

CCACACCACC 

ACTCCAAATC 
-201 

CACATCCCCA 
CCCAACCCTC 
AATCTAAACT 
13 

CLY CLY SEE TH A 
CCA CCT TCC ACC 
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ILE ALA 

ATT CCC 

CLY CLU 
CCT LAC 

AEC LEU 
ACA CTC 

AEC AEC 
ACA ACA 

HET TYR 
ATC TAC 

ASP CLU 
CAC CAC 

ASP LEO 
CAT CTC 

CYS CLM 
TCC CAC 

ASP PUE 

CAC TTC 

AEC AEC 
ACA ACA 

PEE LEU 

TTC CTC 

YAL AEC 
CTC ACA 

ALA AEC 
CCC ACA 

SEE CLY 

TCT CCT 



CLY 
CCC 

ASH 
AAC 

Asr 

CAC 

ALA 

CCC 

TMR 
ACC 

LEU 
TTC 

MIS 
CAC 

ASP 

CAC 

*.YS 
AAC 

SER 

TCC 

ILE 
ATC 

TMR 
ACA 

LYS 
AAC 

ILE 
ATT 



AEC 
ACA 

ASM 

AAC 

SEE 

TCC 

ILE 

ATC 

AEC 
ACA 

LEU 
CTA 

CLY 
CCC 

PME 
TTC 

TMR 
ACA 

ASP 
CAT 

TMR 

ACC 

VAL 

CTC 

CLM 
CAC 

CLY 
CCT 



23 

LEU ALA 
CTC CCA 
43 

ILE ASM 
ATC AAC 
43 

LYS THE 
AAC ACC 
■ 3 

YAL PRO 

CTT CCT 
103 

ALA SEE 

CCC TCT 
123 

PEO LEU 
CCT CTC 
14) 

PHE ASP 

ITT CAC 
143 

LEU AEC 
CTC ACA 
113 

SEE MIS 

TCC CAT 
203 

SER ALA 

TCT CCC 
225 

SEE TME 

TCC ACC 
243 

PEO MET 
CCA ATC 
243 

ILE YAL 
ATT CTC 
2S3 

ALA ALA 
CCA CCT 



ASN 
AAC 
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ALA 

CCC 

CYS 
TCC 
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ATC 
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CCT 
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LYS 
AAC 
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ILE 
ATC 

MIS 
CAC 



LEU 
CTC 
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CCT 
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ACC 
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CCC 



SEE 

TCC 



ASP 

CAC 



TAP 
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ASM 
AAC 



ASP 

CAC 



LYS 
AAA 



PEO 

CCA 



ALA 
CCA 



ALA 
CCA 
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CCC 
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TCT 



PRO 

CCT 



SEE 

TCC 



MIS 
CAC 



ILE 

ATC 



CLU 
CAC 



CLU 
CAC 



TYE 

TAC 



ASP 

CAC 



LEU 
CTC 



CYS 

TCC 



LEU 
TTC 



30 

ASP 

CAC 

30 

VAL 

CTC 

70 

TYE 

TAC 

TO 

ILE 

ATC 

110 

TYE 

TAC 

no 

ILE 
ATC 
130 
LYS 
AAC 
170 
SEE 
TCC 
IfO 
MIS 
CAC 
210 
YAL 
CTC 
2 30 
LYS 
AAC 
230 
ASM 
AAC 
270 
CLY 
CCA 
2 tO 
AEC 
ACA 



CLM A5N 
CAA AAC 

TYE LEU 
TAC CTT 

SEE SEE 

TCC TCC 

LEU CLY 
CTT CCA 

ASP ASP 

CAC CAC 

CLU TME 
CAA ACT 

YAL SEE 

CTT TCC 

CLM CLY 
CAC CCA 

TEP LEU 
TCC CTC 

MIS PEO 

CAC CCA 

YAL ILE 
CTC ATC 

PRO LYS 
CCT AAC 

TME ILE 

ACC ATC 

SEE YAL 

TCC CTC 



LEU 

CTC 

PRO 

CCC 

AEC 
ACA 

CLY 
CCC 

TEP 

TCC 

TYE 

TAC 

PHE 

TTT 

ILE 
ATT 

LYS 
AAC 

THE 

ACT 

ILE 
ATC 

LYS 
AAC 

SEE 

TCC 

CLT 
CCC 



THE 
ACA 

CLY 
CCA 

PEO 

CCA 

CLY 

CCC 

CLU 
CAC 

CLM 

CAC 

CLY 
CCA 

PEO 

CCT 

TEP 

TCC 

MET 
ATC 

CLU 
CAC 

PEO 

CCT 

SEE 

TCT 

VAL 

CTC 



33 

VAL 

CTT 

33 

VAL 

CTC 

75 

SEE 

TCC 

95 

SEE 

TCC 

115 

SER 

TCC 

133 

AEC 

CCT 

133 

ASM 

AAC 

173 

VAL 

CTT 

193 

ILE 

ATT 

213 

AEC 

ACA 

233 

ASP 

CAC 

235 

VAL 

CTC 

275 

PEO 

CCT 

2Y3 

LYS 

AAC 



ALA LEU 

CCC CTC 

TYR PRO 

TAT CCT 

LYS ALA 
AAC CCT 

SER ILE 

TCC ATC 

CLU CLY 
CAC CCA 

PRO CYS 

CCT TCC 

TYR TMR 
TAC ACC 

VAL ASP 
CTC CAC 

ASM AEC 
AAC ACA 

ASM LYS 
AAC AAC 

CLY LYS 
CCC AAC 

SEE ARC 

TCC ACA 

LEU VAL 
CTC CTC 

PRO ILE 
CCA ATC 
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CLS m CLY LED ALA AIC riE ••• 

BAC ACC CCA CTT CCC ACA TtC I*A CCAC ACCTCCAACC ACATACCCCT TTTCACAACC 

2000 

CTCTTTCAAA ATACTTCTTT TTCTCCTTTA TATCCTTTAT CAACTCATCA CAT6AAAACC 
2090 

10AAAIACC0 ACTATACCAA AATTTAATCA AAATTAAATT AAATATTTTC TIACCCTATT 
ACTCACCTTC AAAATCCCCC CCCCTTCTAA CAACCTTCTC ATCATCCACA ACTACCACTC 
CTTTACCTCC AACCTCTACC ACTACCTCTC TCACCACCCA CCCAATCTCC "^TTTCAC 
CAACCATCAC ATCACCATTC CCCACATTCA OCACCTCAAC "«ACCTTC TCCTCATATC 
CCCTCCTCCT CCCCATCCAA JJJCACACTC wLtATCT CCCCACCTCA TCACCCATIT 
1AAACCCAAC ATTCCTCTCT TTCCTCTCTC TATCCCCCAC CACTCTATCT TCCACCACTT 
TCCCCCACAC CTCCACTATC CCCCWACA1 TCTCCATCCA AAAACCTCCA CTCTTAACCA 

ccacaacaac ccaatcttca aaaaccttcc ccaacatctt cctctcacca 2i?2ccACxc 

OCTCCCCCCA ACCCTCAACT CCCTTCC6CA CTCTCTA*AC ""cTCCTC CCACACACAA 

2 400 

CCCCATCATT ATCCCTCTCA 6 AC AC A AC AA CTACACCATC OACCCCCTCC ACTT2CATCC 

AC AC ACC ATT CTCACCCACC ACCCCCATCT CATCATCCAC AATATCCTCA ACCTTTCCCC 

TCCTTACTCC CACCAAAATC CCAACCCCCC CCCTCACACA AACCAAACCA TATTCCACAA 

AA7ATACCCC CACACACCAA AACACIACCA CTTTCACATC AACACACCCC W«CACATT 

TCCTCA1CTA CAACTCTACT TCTCCATCCC ACTCCACCCC CCCIAATCAA TTTTTAC6AC 

ACATTCCACC ACAACATC AC CCCCCCCAAC CTTCCAATTC TCACCCAAAT CAACACACCC 
2150 

TCCCCTTCTA AACCCCTCAT CCaCCCACAC CCTAACCCTC CCAAACACCC CCTCAACTAC 

CCCAACCCTO CACTTCCCAC AATTTCTCTT TTCACCCACC CAACCTCCTf TAAACCAAAT 

ATCCACCACC TCCACCTCCC CACAAAACCC ATTCACTCTC TCCCCAAXAC !«CTCTATT 

TTCCCCAACC ACTTTATCTT CAACAACTAC CAAATTCTAC li"ccCACT CCCCCCACCA 

CACACCCTTC TCCTCATTCT CAACAlCCTC ACCTC 
3250 
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XMOXA (fig. 6) 

isolate 2 A kb fragment 

|/ma ill 
isolate 380 bp fragment 



Fig, 12A 
Iran spo son Tn5 
Sa/I 

r X/na III 
isolate 1.1 kb fragment 



M13 mp9 



Sal I 
Mod III 




Sal I 

alkaline 

phosphatase 



alcohol precipitation 
J 



A MOX 4 (fig. 6) 
5a/ 1 



3- 



S'lGGC-^ 



NE0 7 



ACGTGCCA- 



HgiA\ 

isolate 2.4 kb fragment 



HgiAl 



-CTCCGGCCGCTT6 3 ' 

G AGGCCGGCG A AC T T A A 5 ' 
1 1 1 1 -i » 



NE0 6 NEO 8 
phosphorylate * mal " EcoRI 

NEO 6 and 7 with T4 polynucleotide 
kinase and rATP 



NEO 3. & 7 and 8 
, T4 ligase 

isolate 86 bp fragment 

isolate 1.5 kb fragment | T4 polynucleotide kinase and rATP 



T 



T4 ligase 



M13 mp9 

Isa/I 
1 EcoRI 



113 ! 



isolate 
vector 
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Fig. i:b 

Promoter MOX-Neomycinphosphotransf erase adaptor fragments 
NE03 5 ' CGGTGGTGACATCAATCTAAAGTACAAA 3* 

NE06 5 ' TCATTTTGTTTTTGTACTTTAGATTGATGTCACCACCGTGCA 3 • 

NE07 5 1 AACAAAATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTG 3' 

NE08 5 ' AATTCAAGCGGCCGGAGAACCTGCGTGCAATCCATCTTGTTCAA 3' 



3X 



pUR 3101 
Una Hi 

I alkaline phosphatase 
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pUR 3102 
\Xma i 



XMOX-4 (f ig. 6) M13mp19 

55/1 \sal\ 
Sac I sac I 



t 



Xrna II 
Sail 
Hin dill 




isolate 1.2 kb fragment 
I 



ligase 




isolate 2.7 kb fragment 



transformation 



M3V 

fir. 7 3 

<---- PROMOTER HOX/AAO ADAPTOR- 
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Hi nd I I I 
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A 8 2 

sssss sssss sssss sssss sssss sssss 

5 A 2 

sssss sssss sssss sssss sssss sssss 

602 

ssss: sssss sssss sssss sssss sssss 

SSSSS 8SS2S2 jffisg}* SSSSS SSSSS 

sssss sssss sssss sssil sssss SSSSS 

JCpnl 

SSSSS SSSSS SSSSS SSSSS SSSSS SSSSS 

sssss sssss sssss sssss sssss sssss 

902 

sssss sssss sssss sssss sssss sssss 
sssss sssss sssss sssss sssss sssss 

1022 

sssss sssss sssss sssss sssss sssss 



C AG 

CTCACCT 
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5acl 



Fig, 14A 

#mal c 

5a/ 1 




isolate 1 kb fragment 



5a/ 1 
tf/ndlil 



Hindi! P l, 9 ase 
1 I ▼ 



adaptor 




5' prMOX-AAO 
C » i — 



3' 



ACGTG- 



s3* 



CAGCT* 
• . 1 



Sac I 



partial Sat I 



isolate 

linearized plasmid 

L 



%/AI |TAiigase Sa / , 

isolate 73 bp fragment 



pUR 3102 

[Sa/ I 
IHgi Al 



isolate 1.4 kb fragment 
containing promoter 




transformation 
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Sac 



HindlW 




pUR 3003 



i partial 5a/ 1 
klenow poly 



merase 



k EcoR\ 

Hpa\ 
Sal\ 

klenow polymerase 

isolate Leu 2 containing 
fragment 



isolate plasmid 
linearized on one of 
the Sail sites 




EcoR\ 



transformation 
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Fig. 16A 




Hin dill 



Hin dill 



|Sa/l 
[Hin dill 

^TA ligase 
,Sa/l 




Sa/I 



Sa/| 
Hin dill 




5. prMOX-HGRF adapter 



oX .ACGTG, 

Sac I ^Al 

'£co Rl 



TA ligase 



-A 3 ' 

-TTCGA 5 ' 

• , 1 

Hin dill 



isolate 110 bp fragment 



PUR3102 
\Sai I 
^Hgi A I 

isolate 1.4 kb fragment 
containing promoter 



Hin dill 



^TA ligase 
^Klenow polymerase 
[TA ligase 
HoiAl 

Hin dill 
Sail 

Sac I 
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Fir.. 163 



PURS 528-03 

Hpa\ 
\Sal I 

isolate Leu 2 
containing 
fragment 



pUR 3203 
Sa/I 



1= 



Hin dll 



HgiAA 



ligase 
^klenow polymerase 
T4 ligase 



Hind\\\ 




EcoRl 



ECO ,y, t-» c 

■ Sac I 
I Sacl 
transformation 
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Fig. 16C 



P HAR5 I 
EcoRI 

I partial HindUl 

isolate 

linearized plasmid 
containing HARS I 



Pvu)\ 



1 



TA ligase 



PUR 3203 

\Eco Rl 

^partial Hin dill 
isolate 4kb fragment 

MOX U TER n S PrM0X ' HGRFa " d 

_J 
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AMOX U 

\Sph\ 
[Kpn\ 

isolate 2.3 kb 
fragment 



Sph\ 
Hin dill 




Kpn I 



XMOX A 
\Kpn\ 
\Sac\ 

isolate 

1.5 kb fragment 



| Sac I 

| partial Kpn I 

isolate linearized fragment 
containing prMOX.MOX and HGRF 




cKpn\ 
Kpn I 



Eco Rl 



Sac! 
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Fir.* j &u 



p HARS I 

I partial Hin dill 
Eco Rl 



pUR 3209 

partial Hin dill 
I partial Eco Rl 



isolate 

linearized plasmid 
containing HARS I 
L 



isolate 4kb fragment 
containing pr.MOX, MOX 
HGRF and MOX TERM 
J 




04 
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Fin. 16? 

pURS 528-03 P UR 3209 

H F>a I partial Kpn I 

I klenow 

klenow polymerase 

polymerase + 

isolate Z.et/2 
containing fragment 



Kpn\ 



Sph I 
W/7 dill 




FcoRI 



EcoRI 

Sac I 
r Sph\ 

transformation 



1 
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Pig.* 18A 

C CATCCACCTC 

CTTCCCCAAT CATTCACCTC CTCCACCCAA AACCCCTCTT TTCGCCAAAa'aAAGCCCACC 
CTTCATAACT CCCCACCCCA TATTTCAAAC AACACCCAAT AACAAAAAAA CCTCAATCAA 
ATCCCCCAAA CCATACCACT TATTACCATA AACAAAAAAA AAAAAAATCT ATT^CCTCTT 
ATTATAATTA CTTCAATAAT TTCATAACCA TCATCCTTCG CCCCCCTATT CTCATCACTC 
CTCCCTCTCC AACACCTAAA JCCACTTTCC TCAACAACCT CTTTCCTCAC TTCCCACACA 
ACTTTCCATT "CCCTCTCC AACACCACCA CAAAACCTAC ACCTCCTCAA AAACACCCTC 
TCCATTACCA CTTCACCACC CTACACCACT TCAACAACAT CATTCAAGAA AACAAATTCA 
TTCAATCCCC CCACTTCTCC CCCAACTACT ACCCCACCTC TCTCA AACCT GTGCAACACC 
TCCCCCAACT CATCAACACA ACCTCTATTT TCCACATTCA TATCCACCCT CTCAACACCC 



TCAAG AACAC CAACCTCCCA GCCCCATTCC TCTTTATTTC TCCTCCCTCC ATCCAACACC 



TCAAGAACAC fCTCCACACC CCTCCAACAC ACACCCCTCA ATCTCTTCCC AACCCCCTTC 
CTCCTCCATC TCCGCACATC CACTACCCCA CGCCACTCGA CACCACAACC TCATTCTCAA 
CCATCACCTT CACAACCCCT ACTCTCACCT CAACCACTTC ATTITCCCCC ACCc'cATCTA 



ACCATTCATA AATTTTTAAT ATCTACACCT CTCATACCCC ACAGTATCTC CTCCAACCTT 



• 1304 

GCCTCAACCT TCTCCTCTTC ATCCTCCTCA ACACTCATGG CATCCACCTC CTGCTCCTTT 
TGCTCCACCC JCGCATATAT CTCCCCATAC AGCTTCACTT GGATTTTCAT CAAACTCTCA 
AACGTACCGT CCACCACTCA CAGTCCCACC CCAATCAACT CCTCCATTTC CTTCTTCACC 
CCTCTCTTCA TCTCCCTCTA CATATTTTCT GCCTCCTCCT ACTCAACTIT CAACTTCTCC 
ACCTTCTCCA CGCTCTTCTC TAACTCCTCT CTTTTCTCCC TGMATCCTC CTCCCTCACC 



TCTCCCTCAA TCCCTTCCTA CTCCCTCTCC ACCTTCGAAA GCTTCAATCG TCAAACCTCC 



■»5* 



TAATCCACCT TTTTCCCTGC CCCCTTCTTC ATCACCTTCT TCATCTCCTC CTTGTACTTC 
TTCACCTCGT TAATCCCCTC CACCACCCTC ATCCTCATTC CCTCCACAAT TICTGCCAGA 
ATATTCTCTT TCATCTCTTC CACCATCTGC AGATAATTCA GACAAATACC ATCTCTCCCC 
TTCACCTTCT CCTCTTCTGG CCCTTCCCCA fCTTCCCACC CCTTATCAGC CTTGAGCTCA 
AACCTATACT CTCCCTAAAA CCACTCCACT Ct!cTAGCCA TATTTATCIG AGTCTCCACC 
AGATTCTCCG AAATTCCCCA CAAAACGCCC TACITCCTGC TCCAGCTCCT TGCTCTAACT 
CTCGACTTTC CCCAAATTGG CCTCCTCCAC GTCAAACTCA GGATCAACAG ACGCCTCACC 
TTTCTTTCTC CCTACTATCA CATCTGCTCC CCCACCATTC ACACCTTTTT TAAACCCAAC 



CCATCACATC TCCACCAAAC CCTCCTITCC CCCAGTTAAA TATTTTTCCC TATCTACCAC 



-404 

ACATCTTTCC ACCCTCCCGT CCCGTCCATC GCAAAATATT ACCCCACCAA CAACCACTTC 
CTTCCCTTAC CCACCACCCT GCCCAAGCCT TTTTCCCCGC TCTACACAGG CCCAATGAAA 
TCTCGGCGCA ATCTCAAACC CATCAAACCC ACCACACTGC CAACAACCTC ACTCCACTAT 



-20* 
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183 



TTTTTTTTTC 
CCAATTTACC 
CTCCTTCCCC 



TCTCTTTTCTT 
-14 



1 

MET 

CCC ATC 



CLH AEC ILE 
CAC AC A ATC 

CLY HIS PEO 

CCC CAC CCC 

LEU LYS TYE 
CTC AAA TAT 

CLY HIS VAL 

CCT CAC CTC 

MET ALA CLH 
ATC CCC CAC 

CLU ILE CLU 
CAA ATC CAC 

SEE VAL CLY 

TCT CTT CCT 

ASP ILE ILE 
CAT ATC ATC 

ALA LED CLO 
CCT CTC CAC 

AS? ASH ASM 
CAC AAC AAC 

ALA LYS PHE 
CCC A AG TTC 

ALA THE ILE 
CCT ACC ATT 

ASH CYS ARC 
AAC TCC ACA 

ASH ALA LED 

AAC CCT CTC 

CLH LYS PRE 
CAC AAC TTC 



ASP LYS LEO 

CAC AAC CTC 

CLO CLY CLH 

CAC CCC CAC 



20 
ILE 
ATC 
AO 1 
CLY 
CCC 
60 
ALA 
CCT 
80 
CYS 
TCT 
100 
LED 
CTC 
120 
HIS 
CAC 
140 
LEU 
CTC 
160 
THH 
ACC 
ISO 
SEE 
TCC 
200 
CLH 
CAC 
220 
LYS 
AAC 
240 

VAL 

CTC 
260 
THE 
ACT 
2S0 
CLY 
CCC 
300 
TYH 
TAC 
320 
VAL 
CTC 
340 
CLU 
CAG 
360 



TACTC AAATA CCCTATCCTC CTCTCCCTCC CCTCATACCT 
CTCCTTCCAC CCATTCACCC CCCACTCAAC ll^CCTCCC 
ACCAAAACCC CTATAAATAC ««CCATCC CTCTGCTACA 

s is k in ii: « - « « - 



LYS 
A AG 

SEE 

TCC 

PRO 

CCC 

LED 
CTC 

LYS 
AAG 

ASP 
CAC 

ALA 

CCC 



TYR CLY 
TAC CCT 

ALA MET 
CCC ATC 

ASH ASP 
AAC CAC 

PHE CLH 
TTC CAC 

SEE TYH 

TCC TAC 

ALA VAL 

CCC CTC 

ILE ALA 
ATA CCC 



ASH 
AAC 

ILE 
ATC 

VAL 

CTC 

ALA 

CCC 

LYS 
AAG 

VAL 
CTC 

SLtJ 
CAC 

ILE 
ATT 



LYS VAL 
AAC CTC 

«E LED 

TCC CTC 

CYS CYS 
TCC TGT 

CYS ASH 
TCC AAC 

ALA LEU 

CCC TTC 

ILE CLY 
ATT CCA 

ASP CLY 
CAC CCT 

"O CLH 

CCC CAC 



ALA 

CCC 

ME 

TTT 



CLU TUP 
CAA TCC 

LEU ALA 
TTC CCC 



25 
ARC ALA 
CCT CCT 

45 
CLY ALA 
CGC CCC 

65 
*R0 ASH 
CCT AAC 

85 
TYH ILE 
TAT ATC 
105 
HIS SEE 
CAC TCC 
125 
CLO VAL 
CAC CTC 
145 
THE LYS 
ACC AAA 
165 
TYH CYS 
TAC TCC 
185 
ALA CLY 
CCC CCC 
205 
ASP CLY 
CAC CGC 
225 
TEP ASH 
TCC AAC 
245 
CLU TYE 
CAC TAC 
265 
SEE CLY 
TCC CCT 
285 
VAL AHC 
CTC CCC 
305 
ASP VAL 
CAC CTC 
325 
LYS SEE 
AAC ACT 
34 5 
AHC MET 
CCC ATC 
365 



CTAAAGCGCT 
CTACCATCCC 
CAGTCATCTC 
15 

ASP CLU CLH 
CAC CAA CAA 



HIS 
CAC 



LEU VAL LEU 
CTT CTC CTC 

MET ALA ILE 
ATC CCT ATC 



TYE PHE ASH 
TAC TTC AAC 



30 
ASP ILE 
CAC ATT 

50 
CLY ILE 
CCA ATT 

70 



PHE CLH HIS 
TTC CAC CAC 

ASH ASP PHE 
AAT CAC TTC 

THE THE CLY 
ACA ACC CCC 

ASH LEU ALA 
AAC CTC CCT 

MET VAL CLY 
ATC CTT CCC 

HIS MET CLY 
CAC ATC CCC 

SEE VAL ASP 
ACT CTT CAC 

VAL ILE CLU 
CTC ATC CAC 



ALA CLH ALA 

CCC CAC CCC 

ALA ALA PRE 

CCT CCC TTC 

CLU LEU LYS 
CAG CTC AAA 

TYE ASP PHE 

TAC CAC TTC 

LEU VAL ALA 
CTC CTC CCC 

AEC CLY CLU 
ACA CCC CAC 



AEC ASP 
ACA CAC 

90 
LED TYE 
CTC TAC 
110 
BIS SEE 
CAC TCC 
130 
FRO LEU 
CCC CTC 
150 
ALA THH 
CCC ACC 
170 
ASP ALA 
CAT CCC 
190 
LEU ASP 
CTC CAC 
210 
ILE ALA 
ATT CCC 
230 
VAL CLO 
CTC CAC 
250 



VAL CLU 
CTC CAG 

ALA LED 
CCT CTC 

AEC PHE 

ACC TTT 

CLY LEU 
CCT CTC 

LEU CYS 
CTC TCT 

CLY CLH 
CCC CAC 

TYE ASH 
TAC AAC 

CYS LEU 
TCC TTC 



CLH TYH 
CAC TAC 

TEP LYS 
TCC AAA 

VAL LEU 
CTC CTC 

LYS SEE 
AAG TCC 

PEO CLY 

CCC CCT 

CLY ILE 
CCT ATC 

LYS PEO 
AAG CCC 

CLH CLU 
CAC CAC 



ASH LEU 
AAT CTC 

ASH THE 
AAC ACC 

ASH ALA 
AAC CCT 



CLU LYS 
CAG AAC 
270 
CLU ASH 
CAG AAC 
290 
ILE LYS 
ATC AAC 
310 
'HE LYS 
TTC AAC 
330 
LYS TYE 
AAC TAC 
350 
LEU PEO 
CTC CCA 
370 



HIS AEC 
CAC ACA 

HIS CYS 
CAC TCT 

TYE CLY 
TAC CCC 

CLU LYS 
CAC AAC 

VAL LYS 
CTC AAC 

LYS ASH 
AAG AAC 



ILE VAL 
ATT CTC 

CLU ASP 
CAC CAC 

SEE CLU 
TCC CAC 

«0 THE 
CCA ACA 

ALA ALA 

CCT CCC 

MET ASH 
ATG AAC 

FEO ALA 

CCC CCC 

ALA TYE 

CCC TAC 

TEP LYS 
TCC AAC 



35 
CLY 
CCA 
55 
TYE 
TAC 
75 
SEE 
TCC 
95 
MET 
ATC 
115 
HIS 
CAC 
135 
SEE 
TCC 
155 
CLY 
CGC 
175 
CLY 
CCC 
195 
LEU 
CTC 
215 
ILE 
ATC 
235 
ASP 
CAC 
255 
LED 
CTT 
275 
HIS 
CAC 
295 
PEO 
CCC 
315 
CLU 
CAC 
335 
PEO 
CCT 
355 
SEE 
TCC 
375 



CLY 
CCA 

THE 
ACC 

ASH 

AAC 

THE 

ACC 

PEO 

CCA 

ASH 
AAC 

PHE 

TTT 

PEO 

CCT 



TYE 
TAC 

SEE 

ACT 

VAL 
CTC 

ILE 
ATC 

CLY 
CCT 

ALA 

CCC 

CLY 
CCC 

CLU 
CAC 

PHE 

TTC 
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Fig. Ufa 

LEU FRO GLN C LN CLU FHE THIt CLY u» at * 

CTC CCC CAC C AG G A A TTC ACC tcl clc tcT CCT lei ill rrr * LA CUJ " U VAL 

380 UC CCT CCT AC « GCC CCT CCC AC A GAG CTT CTC 

ARC ALA LEU CLY CLK ASH CYS LYS SER VAL lit ai a pt * lit 395 

**** ccc CIC nt CAC AAC * cc - c |S Si iS SK SI c rll £1 c * c ;g ™ »; 

Cfc A AT CAC K III X* Jft ffi «« g« gj »0 .» LEU SER THR 2£2 „S 

420 TAT TTC ATG CAC C CC TCC CTC TCC ACC CAC TCT 



440 

S ALA 

: ccc 

460 
: MET 
: ATC 

460 

i I LE 
ATC 
500 
VAL 
GTC 
520 
SER 
TCT 
540 
SER 
TCC 
560 
ALA 
CCC 
580 
ALA 
CCA < 
600 



445 

i ALA 

: ccc 

465 
ALA 

: cct 

485 

HIS 
CAC 
505 
LEU 
TTC 
525 
CLY 
CCC 
545 
CLU 
CAC 
565 
LEU 
CTC 
585 
ALA 
CCA 
605 



?" 625 



640 



410 Ai5 
FRO 

CCC 
430 

* CLY 
: CGA 

450 
f THR 

: acc 

470 
i ARC 
: ACA 

490 
I CLU 

• CAC 
510 

t ALA 
' CCA 
530 
ALA 
CCC 
550 
LEU 
CTC 
370 
ASK 
AAC 
590 
LYS 
AAC , 
610 
ASF I 
CAC < 
630 
THR 1 
ACC < 
650 





A 3 5, 
^ J J 




i Rig 


ATA 
AJ.A 


MET 


CAT 


CCC 


ATC 




A ft C 
* D j 




ILE 


THR 


SER 


ATC 


ACC 


TCG 




475 




LEU 


GLN 


CLU 


CTC 


CAG 


GAG 




495 




CLY 


FRO 


THR 


GCC 


CCT 


ACG 




515 




TYR 


MET 


ARC 


TAC 


ATC 


AGA 




535 




FRO 


FHE 


SER 


CCA 


TTC 


ACC 




555 




ALA 


CLN 


ARC 


GCG 


CAG 


AGA 




575 




GLN 


ILE 


ILE 


CAG 


ATT 


ATT 




595 




ARC 


LYS 


THE 


ACA 


AAC 


TTC 




615 




ILE 


CLY 


TYR 


ATC 


CCC 


TAT 




635 




ASP 


CLY 


HIS 


CAC 


CCC 


CAC 




655 





is s s a m m ™ js s » as ss a? s is a a s; 2 

is B! K S! % IS ffi s ffi ;» gj ™ ;H Si - s IS £ 13 

i" % s si in 2; s: s s 1:5 ;r c •» s «■ «■ •» »» ss ..0 

7 00 CC CAA ACA CAC CCT TTG CTC CTC CAC CCA CTT CCT 

CLY FRO CLU CLY LYS ALA *** 

CCA CCT CAA CCA AAA CCC ™ CCACCAT AAACTAAATA AGCTCTGATT AACTAACATC 

AATAACTTCT TTCTCTCTCA ATCCCACCCC ACAAIAACCC CACAAATAAA ACTTICACAC 
TTCCCTCACA AACTCTCCAC CCCCACCCCA CTCACTCTTT CCCCCCCTCC "cSciCCCC 
ACACCCAIAT TICCCACCCA ACACAAACCA TTCC AC A AG G CCT^CCTCCC CATACCAAAT 

ACAATCCATC CCATCC 
2350 
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Fig. 20 

Identical sequences in -1QQQ region of DAS and MOX genes 



DAS -1076 



TAGATATTTTCTGCCTCGTCGTACTCA— 54N— GTGTGATG— 8N— TCACC— 9N- 
* ** ***** ************* * ******** ***** 

* ** ********* *********************** 

TCGAAATTTTTGCCGTCGTCGTACAGTGTGATGTCACC 
MOX -1052 

DAS _ 93? 

ATCGCTTCGTACTCGCTCTGCAGCTTCGA 
**** * *** * * ************ 

**** **** ** ********* *** 

ATCGAATGTAATGAGCTGCAGCTTGCGA 
MOX -987 
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In this patent specification a Pig. 3 was present ai*Hr^ 
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Fig. 3 (amended) 



?!?L S ^ UenC fv°, f the au tonomous-replicating sequence HARS1 



(_G)rC6ACTCCC 6C6ACTC6GC GTTCACTTTr r/irrT/i-r-r. T 



